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Abstract 

Powder auto-indexing is the crystallographic problem of lattice determination 
from an average theta series. There, in addition to all the multiplicities, the lengths 
of part of lattice vectors cannot be obtained owing to systematic absences. As a 
consequence, solutions are not always unique. We develop a new algorithm to enu- 
merate powder auto-indexing solutions. This is a novel application of the reduction 
theory of positive-definite quadratic forms to a problem of crystallography. Our 
algorithm is proved to be effective for all types of systematic absences, using their 
newly obtained common properties. The properties are stated as distribution rules 
for lattice vectors corresponding to systematic absences on a topograph. Conway 
defined topographs for 2-dimensional lattices as graphs whose edges are associated 
with |ii| 2 , \h\ 2 , \h + h\ 2 , \h — h\ 2 , where Zi, h are lattice vectors. In our enumer- 
ation algorithm, topographs are utilized as a network of lattice vector lengths. As 
a crystal structure is a lattice of rank 3, the definition of topographs is generalized 
to any higher dimensional lattices using Voronoi's second reduction theory. The 
use of topographs allows us to speed up the algorithm. The computation time 
is reduced to 1/250-1/32, when it is applied to real powder diffraction patterns. 
Another advantage of our algorithm is its robustness to missing or false elements 
in the set of lengths extracted from a powder diffraction pattern. Conograph is the 
powder indexing software which implements the algorithm. We present results of 
Conograph for 30 diffraction patterns, including some very difficult cases. 

1 Introduction 

Lattice determination problems have been the target of much interest in mathematics. 
In particular, many mathematicians worked on lattice determination from a theta series 
in the second half of the 20th century, with the aim of providing higher dimensional 
counterexamples for the isospectral problem proposed by Kac |16j . Lattice determina- 
tion from a theta series was finally resolved in [25] . |26j . However, it is not well known in 
the mathematics community that a similar problem was being studied in crystallography 
during this period. 

The lattice determination problem in crystallography is called powder auto-indexing. 
There a set of lattice parameters is determined from a powder diffraction pattern. Pow- 
der auto-indexing is equivalent to lattice determination from an average theta series, 
as shown in Appendix [X] At present, powder diffraction is the principal technique for 
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determining the structure of materials that do not necessarily form large-sized crystals. 
A highly automated system of analyzing powder diffraction patterns is required, both 
for basic scientific research and a wide range of industrial applications, not least in the 
pharmaceutical industry. Hence, it is necessary to establish a powerful and reliable 
powder auto-indexing algorithm and software. 

In this paper, the powder auto-indexing problem is formulated and solved. This is 
regarded as a new application of the reduction theory of positive-definite quadratic forms 
to crystallography. In consideration of its application to crystallography, we mainly dis- 
cuss cases when the lattice has a rank of N = 2, 3. Similar to a theta series, information 
about the lengths of lattice vectors is extracted from an average theta series (Section 
[2]). The most notable difference is that it is very difficult to acquire the multiplicities 
of the lengths (i.e., number of lattice vectors satisfying |/| = q for fixed q £ M>o) from 
an average theta series, and only a part of the lengths can be obtained owing to the 
crystallographic phenomenon of systematic absences. 

Systematic absences have not been defined sufficiently explicitly as a mathematical 
notion. We provide a definition in Section [J] As in the International Tables [14] . 
systematic absences are classified by the triplet (G,H, U): 

• isomorphism class of crystallographic group G C O(N) k R , 

• subgroup H C G, which contains the lattice L of G, i.e., an abelian subgroup 
L ~ Jj N consisting of all the translations in G, 

• a connected component U of (M. /L) . 

Consequently, for fixed N, systematic absences are categorized by finitely many 
triplets (Proposition [4J}. When a periodic function p belongs to the type (G,H,U), 
the lengths of any I* £ T ext (G, H, U) is not extracted from the average theta series of p 
easily, where T ext (G, H, U) is a subset of the reciprocal lattice L* of L determined from 
(G,H,U). 

To date, powder auto- indexing algorithms have been studied and improved by experts 
in crystallography. Among existing software packages, Ito [10], [31], TREOR (trial-and- 
error method Q7]), and DICVOL (dichotomy method [3]) are widely used. McMaille (a 
grid search based on the Monte Carlo method [T!5]) and X-cell (dichotomy method [2Tj ) 
were developed comparatively recently. With regard to existing powder auto-indexing 
software, several considerable problems have been reported. These include: 

• existence of more than one solutions, 

• systematic absences, 

• missing or false elements in the set of extracted lengths, 

• observation errors contained in the length values, 

• large zero-point shift, 

• overlapping peaks. 

In some software, these problems are caused by limitations intended to suppress the com- 
putation time. Therefore, both computation time and success rate should be considered 
to discuss powder auto-indexing algorithms. (For example, although the simplest brute 
force grid search may obtain the highest success rate, it takes from a few hours to a few 
days to execute in monoclinic and triclinic cases.) This paper presents a method that 
can resolve each of the problems listed above. The first three are explained in more 
detail in the following discussion. The next two are resolved using the estimated error 
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Err[|Z| 2 ] of \l\ 2 (see (^dj in Section [3] and the beginning of Section!?]). The final problem 
is also resolved naturally in our algorithm (Section [53]) . 

Our three main results are as follows. First, we formulate powder auto-indexing 
as a mathematical problem, and summarize mathematical results on the cardinality of 
solutions in powder auto-indexing. This is considered the most important foundation 
for the following discussion; in general, a lattice L is not determined uniquely in N > 2, 
even if the rank N of L and all the lengths I £ L are given (Appendix [B]). On the other 
hand, the cardinality of L2 satisfying = A^ 2 is always finite in N < 4, therefore 
it is possible to enumerate all such L2 algorithmically (Appendix [C]). However, it is 
not certain whether the number of solutions is actually finite in powder auto- indexing, 
owing to systematic absences and the finite observational range. Fortunately, using 
both physical constraints and mathematical theorems, we obtain a necessary condition 
for assuming the finitcness of solutions, except for events with zero probability (Section 
[3]). As a result, it is only necessary to enumerate finitely many solutions for powder 
auto-indcxing. 

Our second result is a new algorithm to enumerate powder auto-indexing solutions 
(Sections 17.11 and 17. 2p . We prove that it works regardless of the type of systematic 
absences. Our basic idea is to use the graph whose edges are associated with a set of 
squares of lengths |Z| | 2 , \l%\ 2 , l^i+^L l^i-^| 2 f or some lattice vectors 11,1% € L* . 

In Ito's method, the parallelogram law 2(|^| 2 + |^| 2 ) = |ZJ + l^ 2 + \l* - l^ 2 is used 
to obtain Gram matrices of sublattices of rank 2 (called zones). However, simple use 
of the parallelogram law does not always provide successful results, as introduced in 
Fact [1] of Section HT2l Therefore, instead of just enumerating a set of lengths satisfying 
the parallelogram law, our algorithm utilizes graphs whose edges are associated with 
|^i| 2 , l^l 2 ; + ^1; an d l^i — as a network of lattice vector lengths. This provides 
a comprehensive method of analyzing how the two sets {\l*\ , |Z?;| , | Z* + '2 1 > K* ~~ '2 1 } 
and {|&i| 2 , {k^ 2 , \k* + k^\ , \k{ — k^} are related to each other. 

For lattices of rank 2, such a graph has already been defined by Conway, who called 
it a topograph [B], As explained in section [5.11 a graph having the required property is 
constructed for general N by using Voronoi's second reduction theory. We also call this 
a topograph. Basic properties of topographs for lattices of rank N = 2, 3 are explained 
in Section HT2"1 

In SectionHJJ we shall see how primitive vectors of I* G L* belonging to T ext (G, H, U) 
are distributed on a topograph. Although the following theorem for the case of = 2 
has not been described explicitly, it provides a theoretical reason for the parallelogram 
law working appropriately for N = 2. 

Theorem 1. Let (G,H,U) be a type of systematic absence in N — 2, and let L* be 
the reciprocal lattice of L, a lattice consisting of all the translations in G. Then, for 
any primitive vector I* of L* belonging to T ext (G, H,U), there exists 1% € L* such that 
I* ■ I2 = and I*, 1% make a basis of L* . 

In brief, Theorem [1] claims that I* £ T ext (G, H,U) only appear in the gray area in 
Figure [2 regardless of the type (G,H,U). With regard to the case H — L, we shall 
introduce a theoretical proof using topographs in Section 16.11 The case H D L is not 
proved here since it is verified easily by checking lists in |14) . 

By Theorem[TJ a subgraph of a topograph with infinitely many edges is formed by uni- 
fying substructures as in Figure[Hassociated with the lengths \l\\ 2 , 1/2I 2 , |'i + £jl j K* _ '5| 2 
of I*, Zrj € L* \ T ext (G, H, U) that are easily extracted from an average theta series. 

For A^ = 3, similar statements are proved in Theorems [3] and UJ In this case, it is 
necessary to use the following formula instead of the parallelogram law: 

3|Zt| 2 + |Z 1 +2Z*| 2 = 3|Z2| 2 + |2Z 1 +Z2| 2 - (1) 
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Figure 1: Reciprocal lattice vectors that are allowed to correspond to systematic ab- 
sences. 

••• Kf / 

\r 1+ r 2 \ > ■ - < |/;-/;| 2 

Figure 2: Substructure of a topograph corresponding to the parallelogram law. 

As another consequence of Theorems [3] and |4j our algorithm enumerates the Gram 
matrices (I* ■lj)i<i,j<3 for multiple bases ^, £3 of the true solution L* . This makes the 
enumeration procedure robust against missing or false elements in the set of extracted 
lengths, as explained in Section 17731 

For the third result, we provide a method to speed up the enumeration process in 
Section f7.3l In order not to reduce the rate to acquire the true solution, Theorems [3] and 
[4] are used again here. In the test using actual powder diffraction patterns in Section 
18.21 it is demonstrated that the improvement makes the enumeration speed 32-250 times 
faster. As a result, the enumeration process is executed in a few minutes at most. 

The novel algorithm for lattices of rank N — 3 is implemented in the powder auto- 
indexing software Conograph. In section [51 we introduce the default parameters and 
results of Conograph. The default parameters are selected so that less-experienced 
users of the software can obtain good results without modifying them. We prepare 30 
sets of test data, including difficult cases such as samples from Structure Determination 
by Powder Diffractometry Round Robin-2 (SDPDRR-2). Results for rather difficult 
cases are explained in Examples [3HH1 The total time for powder auto-indexing did not 
exceed several minutes. The Conograph software is scheduled to be distributed in the 
near future from |http: / /sourcefo rge.jp/ projects / conograph / 

Notation and symbols 

The notation and symbols used in this paper are summarized in this section. The inner 
product of the Euclidean space is denoted by u ■ v, and the Euclidean norm u ■ u 

i 

is denoted by \u\ 2 . The standard basis *(0, • • • , 0, 1, 0, • • • ,0) of X N is denoted by 
(1 < i < N). 

A lattice L of rank N is a discrete and cocompact subgroup of R . For any lattice 
L, there are linearly independent vectors i>i, • • • , vn G R w over R such that v\, • • • , v/v 
generate L as a Z-module. In this case, ,vn are called a basis of L, and the 

matrix (vi ■ Wj)i<ij<Af is called a Gram matrix of L. The reciprocal lattice L* of L is 
defined as L* :={/*€ R N : I ■ I* € Z for all I € L}. 
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Figure 3: Lattice parameters and the unit cell. 



If {v\, ■ ■ ■ , Vi} C L (1 < i < N) is extended to a basis of L, it is called a primitive 
set of L. In particular, v € L is a primitive vector of L if and only if {v} is a primitive 
set of L. P n (L) is the set consisting of all the primitive sets of L of cardinality n. 

A symmetric matrix (s<y)i<i,j<JV is always identified with a quadratic form Q(xi, • • • , mjv) 
5^ i=1 IZj=i SijXiXj. The linear space consisting of N x TV symmetric matrices with real 
entries is denoted by S N . S^ (resp. Sy ) is the subset of S N consisting of all the 
positive-definite (resp. semidefinite) symmetric matrices. Si,S2 € S N are equivalent if 
and only if there exists g G Gijv(Z) such that 52 = <?«Si *<?. 

For any 5 € 5 , elements of As are called representations of S over Z. 

A s := { l vSv :0^v EZ N }. (2) 

In crystallography, a crystal lattice in three-dimensional Euclidean space is repre- 
sented by a set of lattice parameters a, b, c, a, /3, and 7 as in Figure |3] This parame- 
terization is transformed into a 3 x 3 matrix S := (sy )i<i.j<3 as follows, and S is the 
Gram matrix of the Bravais lattice of the crystal. 

sn=a 2 , s 22 = 6 2 , S33=c 2 , (3) 
S12 = ab COS7, S13 = ac cos/3, S23 = 6c cosa. 

In general, for any lattice L C K , its automorphism group is defined by {g € 
GL(3,Z) : g^'i? = S}, and L is categorized into its Bravais type by the conjugacy 
class of the group in GL(3, Z). In N = 3, it is known that there exist 14 Bravais 
types. In crystallography, selection of the Bravais lattice L2 C L and its basis h,hih 
is standardized according to the Bravais type of L. S in ([3]) is the Gram matrix defined 
for the basis of L^. When L belongs the category called a primitive centring, L = L2, 
and Zi, ?2j Z3 are chosen by the method called the Niggli reduction, which is very similar 
with the Minkowski reduction in TV = 3 (see [2]). When L belongs to the category 
called a face-centered (resp. body-centered) lattice, there exist h € L such that L 
is generated by (1 < i < j < 3) (resp. h+h+h. - k (I < i < 3)), and k ■ l 3 ; = 

(1 < i < j < 3) holds. Regardless of the choice of h,h,h, their Gram matrix S and L2 
generated by these l\, I2, h are determined uniquely. With regard to the Bravais lattice, 
the explanation above is sufficient in order to understand our following discussions. 

In the context of powder structure analysis, representations of S" 1 over Z are called 
q-values of diffraction peaks. 

2 Outline of the powder auto-indexing problem 

A function p on is said to be periodic if L := {I e M. : p(x + 1) = p(x) for any x € 
M. N } is a lattice. We call L the period lattice of p. For example, an electron (or nucleus) 
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density in a crystal (cf. the left figure in Figure S]) is a periodic function of N = 3. The 
following p provides its standard model. 

po) = ^2^2^2pi{x- x ik -i), (4) 

i=i fc=i /ei 

where 

m : number of different elements in a crystal, 
di : number of the ith element in the primitive cell M 3 /L, 
Pi(x) : rapidly decreasing function on R 3 that represents the electron distribution of respective atoms. 

According to diffraction theory, if a crystal has an electron density p, the diffrac- 
tion image of its single-crystal sample equals cf S i ng i e (x* ; p) for some c > 0, where 
f single (x* ; p) is a sum of delta functions given by 

f singled* ;p) := £ \p(l*)\ 2 8(x* - I*), (5) 
p(H := / p(.T)e- 2 ^-'* rfx . (6 ) 

JR N /L 

A powder sample is an ensemble of a very large number of randomly oriented crystal- 
lites. As a result, its diffraction image is proportional to the integration of f s ingle(%*] p) 
on a sphere of radius Jq;. 

f P owd er {q;p) ■= j fsingie(x*;p)dx* = 2y/q ^2 Ip(OI 2 %- in 2 )- (7) 

J\x'\ 2 =q l* eL * 

The p(l*) is called a structure factor in crystallography. 

The right figure in Figure 2] presents an actual powder diffraction pattern. It is 
obtained by replacing every delta function in (J7J with some kind of peak-shape model 
function g that is close to a Gaussian distribution and satisfies J„g(q)dq = 1. 




Figure 4: Powder diffraction pattern. 

Ab-initio powder crystal structure determination retrieves the electron density p 
from the powder diffraction pattern under the assumption that the following additional 
information is available. 

• chemical formula (i.e., the ratio [d\ : ■ ■ ■ : d m ]), 

• density of a single crystal, 
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• rapidly decreasing functions pi (1 < i < m). 

Powder auto-indexing is the initial stage of ab-initio powder crystal structure de- 
termination, and aims to find the period lattice L of p. As the positions of the delta 
functions, elements of A p are extracted from a powder diffraction pattern f p0 wder(q] p)- 

A p := {|r| 2 :l* el', F p (q)^0}. (8) 

L is normally determined from elements of A^ in powder auto-indexing. After L is 
obtained, using the coefficients F p (q), powder crystal structure determination is carried 
out. 

F p {q) ■= £ W*)\\ (9) 



3 Formulation of the powder auto-indexing problem 

In this section, we formulate the problem explicitly. In powder auto-indexing, extracted 
A obs is different from the set A p of true |/*| (I* G L*) owing to observational problems. 
Consequently, the Gram matrix S of the period lattice L of p must be retrieved from 
A o6s under the following assumptions. 

(Al) The observed range of a powder diffraction pattern is contained in a finite interval 
[q m in,q m ax] C (0, oo). Consequently, only information about A p n [q m i n ,Qmax] is 
available. 

(A2) Every q obs e A obs has some observation error. It may be assumed that the thresh- 
old Err[<f bs ] on the maximum error is given. (A method to compute Err[g ] is 
introduced in |23j.) 

(A3) Owing to mistakes and errors in acquisition of the peak-positions (i.e., peak- 
search), A obs differs from the true A p n [q m vmqmax\- There exist small e\ > and 
£2 > satisfying the following conditions: 

(i) For arbitrarily fixed q G A p n [q m in, <lmax]i we have q <£ \J q0b , eAobs [q obs - 
Err[<f bs ],<f bs + Eri[q obs }} with probability e x . 

(ii) For arbitrarily fixed q obs G A obs , we have A p n[q obs -Eri[q obs ], q obs +En[q obs }} = 
with probability e^- 

(A4) When the distribution p is constrained by group symmetry, infinitely many p(l*) 
and F p (q) become zero owing to systematic absences (Section 2]) . 

As assumed in (J3|), A f, s extracted from a powder diffraction pattern has missing or 
false elements. These are caused by observational problems including background noise 
or false peaks due to sample impurity. 

With regard to (@|, sometimes F p (q) — holds due to a special arrangement of atom 
positions Xik, rather than systematic absences (cf. [12] , [2"9"]V However, the probability 
is zero if every Xik is distributed uniformly in R N / L. (The only known exceptions are 
systematic absences.) 

For the special arrangement with zero probability, we replace A p by A ext (p) and 
consider (^2) instead of (42J : 

A ext (p) ■= K u f |r| 2 : r \F'% {q) ^° ° w ! ng k reasons ) • (io) 

8 y other than systematic absences J 
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(A3) Assume that 

(i) For any q € A ext (p) n [q m in,q ma x], q t \J q o b . eA o>. [q obs ~ Err[g o6s ], q° bs + 
Err[g ]] occurs with probability ei. 

(ii) For any q obs G A obs , A earf (p) n [? of,s - Err[<f bs ], q obs + Err[q obs ]] = occurs 
with probability ti- 

Under the conditions (y^T|)— (^J?J) , infinitely many solutions may exist in some cases 
(for example, consider the case of very small [q m in,qmax])- Hence, additional assump- 
tions are necessary in order to guarantee a finite number of solutions. 

(A5) Owing to repulsive force between atoms, we may assume min{|Z| 2 : 7^ / 6 L} > d 2 
for a positive constant d ~ 2A. Then, by the inequalities on successive minima 
of L and its reciprocal lattice L* proved by Lagarias et at 18], in N = 2,3, 
the maximum diagonal entry Djq of a Minkowski- reduced(defined in Appendix [Cj) 
Gram matrix S of L* satisfies 

D N < ^max{ 7 , : 1 < % < N}, (11) 

where 7, is the Hermite constant: 

7l := sup {min{ %Sv : ^ v £ Z N } : S <= 5!_ , detS = l} . (12) 

In particular, D 2 < §d~ 2 and D 3 < 3 • 2~ 1 / 3 d~ 2 follow from 71 = 1, 7I = |, 
73 = 2. (In N = 2, the estimation is improved up to D2 < f d -2 easily.) 

(A6) The length of the interval ^Jq m ax — \Jqmin is sufficiently greater than y/D^ so that 
A obs includes |ZJ ± Z^| 2 , |JJ ± Z^ 2 , |±ZJ + + /J| 2 , and at least one of: 

(i) l^il 2 , 

(ii) \ll\ 2 ,\l* 1+ 2l*\ 2 , 

(iii) \l*\ 2 ,\2l* 1+ l*\ 2 . 

for both i = 1,2, where l\, 1%, 1% is some basis of L*. 

In (A^j, 2 A is selected as the minimum distance between the two closest latttice 
points of L satisfied by any existing crystals. (A^ looks rather artificial. This assump- 
tion is necessary for our algorithm in Table [5j By Theorems [3] and |4j holds except 
for events with zero probability if a sufficiently large q m ax is chosen, regardless of the 
type of systematic absences. Under assumptions (A^5§ and , the number of solutions 
is always finite, because A obs C [q m in, qmax] contains only finite elements and the Gram 
matrix of L* is computed from a combination of elements of A obs due to assumption 

(49. 

Here, it is still non-trivial how to select q ma x- At least, it is clear q max > D3 is 
required; otherwise A ext (p) fl [q m in,qmax] contain only |Z*| 2 of I* e L2, some sublattice 
of L of rank less than 3 in some cases. On the other hand, as seen in ([3]) in Section[51 too 
many q- values are frequently extracted from the interval [q m in, D3] when we set d = 2 A, 
hence D 3 := 3-2 — 1 / 3 cZ — 2 , nevertheless powder auto-indexing is very frequently successful 
even with a smaller interval. Since the time of our enumeration algorithm is roughly 
proportional to the fourth power of the number of elements of A obs (cf. Section [773^1 . the 
time can be significantly decreased by minimizing the range [q m in,qmax]- Considering 
the current accuracy of diffractometers and the power of personal computers, q m ax 
should be chosen empirically to some degree, in addition to the theoretical estimation 
above. See ([3]) in Section l8Tl for a more detailed approach to this issue. 
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4 Summary of crystallographic groups and system- 
atic absences 

We now give some definitions for crystallographic groups and systematic absences. In 
the following, we represent the elements x £ M. N /L as a row vector and I* £ L* as a 
column vector. Furthermore, any group action on M. N /L (resp. L*) is represented as a 
right action (resp. left action). 

Any congruent transformation of the Euclidean space M. N is represented as a compo- 
sition of the orthogonal group O(N) and a translation; if a is a congruent transformation 
of K w , there exist r £ 0{N) and v £ such that: 

/^I'+i/foranyiel*. (13) 

Such a (J is denoted by {r|f}. The group consisting of all congruent transformations of 
R N is the semidirect group O(N) x 1*. By expanding the composition ^x^ 1 ^ 1 ^)^ 2 ^ 2 ^ 
it is seen that the group multiplication is given by: 

{riWi} ■ {t 2 |j/ 2 } = {tit 2 K 2 + v 2 }- (14) 

Definition 4.1. A crystallographic group is a discrete and cocompact subgroup ofO(N)x 
R N . 

A crystallographic group is also called a wallpaper group in N — 2, and a space group 
in N = 3. 

For a crystallographic group G, two groups Rq and L are defined by: 

R G ■- { T <= O(JV) : {i» for some 1/ <= I*}, (15) 
L := {v £ R N : £ G}, (16) 

where ljv is the identity of O(N). Rq is called a point group of G. G is represented 
as the semidirect G ~ i?c K From the definition of L, for any {r|^ T } £ G, the 
class ^ r + L £ R N I L is uniquely determined. Furthermore, the map Rq — > R N / L : 
t 1— > j/ r + L is a 1-cocycle, i.e., it satisfies 

2/ CTT = v J + z/ T mod L. (17) 

The following was proved by Bieberbach. 

Theorem 2 (Bieberbach [2], [3]). In eac/i N, 

(1) L is a lattice and Rq is a finite subgroup of 0{N) consisting of r that maps any 
elements of L to L. 

(2) There are only finitely many isomorphism classes of crystallographic groups. 

(3) A homomorphism if : G± — > G2 is an isomorphism between two crystallographic 
groups Gi and G2 if and only if there is an affine map a of M. N such that <p(g) = 
agar 1 . 

We now proceed to the definition of systematic absences. Let G = Rg X L be a 
crystallographic group, and consider the following periodic function p: 

m 

^) = EE^-<)- ( 18 ) 

i=i oeG 

Note that the density model function p in Q, is represented as in (|T5|) for some space 
group G. 
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Let L 2 (M. N ) be the L 2 -space, i.e., the set of all measurable functions / on M. N with 
a finite L 2 -norm H/H2 := (Jrn 1/0*01 dx) 1 ^ 2 < 00. In order to compute the same list as 
[14], let us assume that 

(Isotropy condition) pi belongs to L 2 (WL N ) R ° , i.e., Pi(x T ) = Pi(x) holds for any 
r G Re- 
Then, p{x") = p(x) holds for any a £ G, and p has the Fourier coefficient 

= EE / Pi(x-x!)e- 2 ^*- l *dx 

i=l crgG /"^ 
m 

= EA(**) E e- 2 ^^^, (19) 

<=1 creG/L 

where p,-(Z*) := J KJV p i (aOe- 27rv ' =Tx ' r cfe. 

Hence, the probability of p(Z*) = depends on the size of Wg,i* C V := M. N /L. 

W G ,l> :={xeV: J2aeG/L e" 2 ^^ 4 * = o} . (20) 

Definition 4.2. For any subgroup of L C H C G, let V H be the subset ofV consisting 
of all the fixed points of H , and U C V H be a connected component of V H . Under this 
notation, T ext (G) and T : ex tiG , H ,U) are defined by 

T ext iG) := {I* G L* : V = W G ,j.}, (21) 
T ext {G,H,U) := {I* EL* :U CW G}1 ,}. (22) 

According to the terminology of crystallography, we say that I* G L* corresponds to 
systematic absences at general positions (resp. special positions) if and only if I* belongs 
toT ext (G) (resp. T ext (G, H,U)). 

Remark 1. An interpretation of systematic absences is provided by the Laplacian op- 
erator on V; the Laplacian operator A := 5Z i=1 on V :— R N / L acts on C°°(V), 
the set of smooth functions (i.e., derivatives of all orders are continuous) on V. A also 
acts on 

C°°{V) G := {/ G C°°(V) : f(x) = f(x' 7 ) for any x G V and a G G}. (23) 

Recall that Al* :— {\l*\ 2 : 7^ I* G L*} coincides with the set of non-zero eigenvalues of 
A for the action. (This is seen from the fact that any f{x) G C°°(V) is represented as a 

Fourier series J2i*eL* Q'e 2 "^ 1 ' 1 , and we have Af(x) = J2i'£L' \l*\ 2 ci*e 27r "^~^ x ' 1 .) 
Similarly, A ext (G) is the set of non-zero eigenvalues for the action of A on C°°(V) G , be- 
cause f G C°°(V) G is represented asJ2[i*] eRG \ L * C U'] E ctSG /l e 271 ^^' 1 " . A ext (G, H, U) 
is the subset of A ex t(G) corresponding to an eigenvector f G C°°(V) G that is not con- 
stantly zero on U . 

In the following, we shall focus on T ex t(G, H, U), because T ext {G) = T ex t(G,L,V). 
From the definition, it is clear that — I* and tI* belong to T ext (G, H, U) for any r G Rg 
if and only if I* does. 

Let Rr be the image of H by the map G — > Rg ■ {t|^ t } (->• t. If the image of 
V by v Y^reR yT 1S denoted by V^ r h, V^ r h is open and closed in V Rh . Hence 
the connected component including x G V H is represented as x + V r h . Therefore, we 
may denote (|22j) by 

r ext (G,H,x + V* : *») := {I* &L* :x + V^*h c%.}. (24) 
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When Hj C G is the stabilizer subgroup of Xi € it is not difficult to confirm that 
the following equivalence condition holds for p in (|18[) satisfying the isotropy condition. 

p(l*) — holds constantly when(a;i, • • • , x m ,pi, ■ ■ ■ ,p m ) 
is perturbed in an open neighborhood in V Hl x • ■ ■ x V Hm x (L 2 (R N ) Ra ) m 

TCI 

I* e f| r eKt (G, fli,^ + f Er « 4 ). (25) 

i=l 

The following Propositions and Corollaries summarize basic properties of systematic 
absences used in the subsequent sections. Proposition 14. 1 1 states a well-known fact: 

Proposition 4.1. All types of systematic absences are classified by triplets (G,H,x + 
V^Hfr), where G, H, and x range respectively in 

(a) isomorphism classes of a crystallographic group G = Rq x L, 

(b) subgroups L C H C G, 

(c) representatives of V Rh /V^ r h . 

As a result, there are only finitely many types of systematic absences for each N > 0. 

Proof. When x € V H is fixed, the map y H ► x + y + V r h gives a one-to-one cor- 
respondence between V Rh /V^ r h and connected components of V H . Therefore, sys- 
tematic absences are classified by the above three terms. The finite cardinality of (jgj) 
is obtained by the Bieberbach theorem, and (0 is straightforward because [G : L] < 
oo holds. In order to prove (jej) is finite, we decompose V R " into a disjoint union 
\J yeV H H iy£.R H y+V^~' R H . Then V Rh is closed in V, therefore compact. Since y+V^-' R H 

is open, the number of representatives of V Rh /V^ r h is required to be finite. □ 
r e xt(G, H, x + V^-' r h ) is computed using the following proposition. 

Proposition 4.2. For fixed I* € L* , the equivalence relation among the right cosets 
Rh\Rg is defined by: 

R h tx~Rht 2 ^ t1 * = t1 *- ( 26 ) 

Then, for fixed x € V H , I* € L* belongs to T ext (G, H, x + V R x) if and only if the 
following holds: 

e 2wV=T^ r ^ } -l' = for £very Rgn g Rh \ r (2?) 

Rht 2 ~Rhti 

Proof. From the following condition is equivalent to I* e T ext {G, H,x + V^ r h): 
e 2TV=i({w+sxy+u T )-r = Q for any Sx g y E RH _ ( 28 ) 

Furthermore, 

Sx Tl ■ I* = Sx T2 ■ I* for any Sx € V" E ^ <^> ^ ~ ' l * = for an y 

re Ah 

X! T ( T i l * ~ T ? 1 *) = Rhti~R h t 2 . 

t£Rb 
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Hence, (1251) holds if and only if, for any Sx £ V^ r h , we have: 

J2 gW^Ife'ir £ e W=I(^M* =fli (30) 

[Tl]e(flff\K)/~ RhT^RhT! 

which leads to the statement. □ 

Corollary 4.1. Let M 6e £/ie order of Rg, and Hg,h C I 1 *' &e ifte following union of 
finite linear subspaces of dimension less than N : 

U G , H := (J KeR":^^!-^*^}. (31) 

TTiere t/ien exists fi C L* /ML* such that I* £ r eKt (G, iJ, x + V^ h h ) <^=^ l*+ML* £ 
/io/ds /or arty 1* £ L* \ Hg,h- 

Proof. We may assume x = from the translation » 4 t; - i in y. Then, ^ T = 
for any t £ H follows from a; 6 Hence, [v T ] £ H 1 (G, V) are mapped to by 

ff^G, V) X ^4'" i/^G, V"), where m is the order of R H (cf. 28J, Proposition 6 of Chap. 
VII). Thus, any r £ i?G satisfies ^ T = y—y T +fi T for some y £ V and /i T € (M/m)~ 1 L/L. 
Since = ^y- ^-y r for any re%we have ^-y € T/ flff , and therefore V_ B ^j/ 7 = 

My £ V^ r h . Consequently, y = ^2 aeR u a + £ for some u € V and £ G M~ 1 L/L. 
Hence, ^ T = £ ff6fllf ^ - E*eR H U ° T + *n where A T = /x T + £ - T e M^L/L. By 
Proposition IHH I* £ L* \ Hg,h belongs to T ext (G,H,x + V I2r h) if and only if the 
following holds for any -R#ti G Rh\Rg'- 



Exeijjj T < T i - T 2)=o E t£ b h t ( t i- t 2)=o 

Owing to M A T e L, whether I* £ L* \ Hg,h belongs to r ext (G, if, x + V^ r h ) depends 
only on the class I* + ML*. □ 

There exists a simple condition equivalent to I* £ T ext (G): 

Corollary 4.2. For any given crystallographic group G, 

I* £ L* belongs to T ext (G) 3t £ Rg such that rl* — 1* and v T ■ I* (f. Z. (33) 

Proof. Let Rg,i* C Rg be the stabilizer subgroup of /*. Then, R^t^RlTi if and only 
if T2 £ t\Rg,i*. Hence, 

R l t 2 ~Rlti 



The statement follows from the fact that r H> e^vi^-z j s a homomorphism on i?c 



□ 
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Now that we have defined systematic absences, it is possible to formulate A ext (p) 
in (fTU)) precisely; for any triplet (G, H,x + l/^ n ff ) with G = Rg x L and x G V H , we 
define 

Ae^cff^ + v^H) : = {|r| 2 :o^rer\r rat (G,ff,x + y E %)}, (35) 

A ext (G) := A ext (G, L, V). (36) 
From the equivalence condition ([25]) . for p in (fl8|) . 

m 

A ext (p) := UA ert (G ) fl i ,i i + V E,, «*). (37) 

2 = 1 

As a result, it is only necessary to consider the case of A ext (p) — A ext (G, H,x + V R n ) 
for some so as to discuss the problem (A^ of systematic absences. 

5 C-type domains and Conway's topographs 

The reduction theory deals with the problem of specifying a domain D C Sy satisfying: 

(Rl) When we put D[g] := {gS*g : S e D} for any subset D C S* and g G GL N {1), 
the subgroup of GLn(Z) consisting of all g G GLn(Z) satisfying D = D[g] has 
only finite elements. 

(R2) For any g G GLat(Z), I? and £%] do not share interior points, except for g G iJ. 
(i?3) 5^, is decomposed as follows: 

S? = (J ( 38 ) 

gHeGL N (Z)/H 

In [5J, a topograph was defined from the Selling reduction with N = 2. So we shall 
recall the Selling reduction first; let An :— {aij)i<i,j<N G <S>_o be the Gram matrix of 
the root lattice A at having entries as follows: 

«y = . . (39) 

I 1 otherwise. 

Using An and the inner-product (S,T) := Trace(ST) = 53i=i 2^=1 s ij£ij on ^> W > 
the domain C Sy is defined by 

Vl el :={SeS? : (S, A N ) < (gS'g, A N ) for any g G Gi w (Z)} . (40) 

If the subgroup of all g G GLn(Z) satisfying l gA^g = A^v is denoted by H(An), 
Sy is partitioned using 2>g ei by the Selling reduction [27] : 

S& = (J ( 41 ) 

g eGL N (Z)/H(A N ) 

In order to generalize the definition of topographs for general N, we'd like to note 
that the tessellation of (|4"Tj) coincides with the one given by Voronoi's two reduction 
theories [32], [33] in N — 2,3. Hence T>g el (N = 2,3) is same as the principal domain 
of the first type defined in the two reduction theories. In the first reduction theory, 
the principal domain is defined as the convex cone expanded by v l v of all the minimal 
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vectors v of An- In the second reduction theory, the same domain is represented as 
follows: 

V($) := {S G S^ : l vSu < b uSu for any v G $ and it G Z^}, (42) 

< := {±Ef=i^e fc :i fe = 0,l}. (43) 

Different from T>g el , in the tessellation of Sy given by the first reduction theory, 
every domain is associated explicitly with the set of integral vectors formed by minimal 
vectors of a perfect form. Even in the second reduction theory, such an association is 
provided by using C-type domains (a union of finite L-type domains), instead of L-type 
domains. 

In the following, we choose the association by the second reduction theory, because 
of Proposition 15.11 on the association of facets of primitive C-type domains and the 
parallelogram law. 

5.1 Topographs for lattices of general rank 

In this section, C-type domains and topographs are defined for the general dimension 
N. We also refer to [23] for more detailed information about C-type domains and 
its connection with covering problems. In [6 , a topograph was defined to explain the 
Selling reduction with N = 2. In contrast, vonorms and conorms were used for the case 
of N > 2. 

The idea of vonorms and conorms as invariants of a lattice seems to have originated 
from the Voronoi vectors defined in Voronoi's second reduction theory; for a fixed S G 
Sy , if v G Z N satisfies S)Sv = vos(v + 2Z ), v is called a Voronoi vector of S. The 
vonorm map of S is defined as a map from v + 2Z N G Z N /2Z N to the representation 
by the Voronoi vector corresponding to v + 2Z N . 

vo s (v + 2Z N ) := min{ l wSw : w G v + 2Z N }. (44) 

Conorms are the Fourier transform of vonorms: when \ is any character on *Z N /2Z N , 
the conorm map of S is defined by: 

cos(x):=-^r £ mo s (v + 21 n )x{v). (45) 

i>+2Z N eZ N /2Z N 

In the following, we use vonorm maps in the definition of C-type domains for clarity. 
Before introducing C-type domains, we shall recall the definition of L-type domains 
(also called secondary cones cf. [30]); The Dirichlet-Voronoi polytope of S is defined by: 

DV(5) := {lei": l xSx < \x + l)S(x + I) for any I G Z N }. (46) 

From the definition, DV(S') is the intersection of half-spaces: 

DV(S') = p| {x €R n : 'xSx < \x + v)S{x + v)}. (47) 

As proved in [33] (cf. [7]), v G Z N is a Voronoi vector if and only if the hyperplane 
H s ,v := {x G K w : *xSx = \x + v)S(x + v)} intersects DV(S). 
A tiling of R N is given by the Dirichlct Voronoi polytopcs: 

R N = [J (DV(5) + 0- (48) 
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The Delone subdivision is a dual tiling of (|4"5)l . If we let Ps be the set of extreme 
points of DV(5), and denote the set of all Voronoi vectors v satisfying p 6 Hsv by ^ p for 
any p £ Ps, then every v £ ^ p satisfies l pSp = \p + v)S(p + v). Therefore, an ellipsoid 
{x £ R N : \x+p)S(x+p) = l pSp] passes through all the elements of {0} U * p C Z N . 
If L p is the convex hull of {0} U ^ p , then the Delone subdivision of R N is given by: 

Del(5):= |J L p + l. (49) 

P eP s ,iez N 

In this L-type domain containing S is defined by {S2 £ S N : Del(.S') = 

Del(S 2 )}. 

For any subset D C S^ and g £ GL N (Z), define D[g] := {gS f g : S £ D}. Two 
domains D\ , D2 C Sy are said to be equivalent if and only if D\ [g] = D2 holds for some 
g £ GLn(Wi). Voronoi proved that the number of equivalence classes of L-type domains 
of dimension [ s finite, and provided an algorithm to gain all the equivalence 

classes [53] ■ This is the outline of Voronoi's second reduction theory. 

For fixed S £ S^ Q , define $5 := {[v] : v £ Z N is a Voronoi vector of S}, where 
[v] represents the class of v £ 7L N when v and — v are identified. A C-type domain 
containing S is defined by V($) := {S2 £ Sy Q : $s = &s 2 }i i- e -, a set of elements of 
having the same Voronoi vectors. Then, V($) is a union of finite L-type domains, 
because a set of Voronoi vectors of S is decomposed into \J pePs ^ P in different ways, 
depending on S. From the definition, the Voronoi map is a linear function on V(<&s) for 
any S £ Sy . 

More generally, we shall define C-type domains for any set $ := {[i>i], • • • , [v n ]} of 
arbitrary size n with v± , ■ ■ ■ ,v n £ Z N : 

V($) := {S £ S^ : %Sv = vo s (v + 2Z N ) for any [v] £ $}. (50) 

This is well defined, because both h)Sv and v + 2Z are invariant if v is replaced by 
—v. From the definition, V(<f>) is an intersection of the following half-spaces. 

v(*) = n n h -°(u,v), (51) 

He* u+21 N =v+2Z N 

H^°(u,v) := {S£S* : t uSu> t vSv}. (52) 

Any S £ Sy is contained in V($>s)- S is said to be in a general position if <I>s has 
exactly 2 N elements. If S is in a general position, V(<I>s) includes an open neighbor of 
S. Such C-type domains are called primitive. Otherwise, there exist [v] ^ [u] £ Z N 
such that u + TL N = v + 2Z N , and S belongs to the following hyperplanes: 

H(u, v) := {S £ S N : "uSu = l vSv}. (53) 

Even for such S, by perturbing the entries of S, S in a general position satisfying 
C $5 is obtained. As a result, the following partitioning of Sy is obtained. 

S?o = IJ V < ( 54 ) 

VEPn 

P N := {V($ s ) : S £ S* , V($ s ) is primitive} . (55) 

This tessellation is coarser than that given by L-type domains. Hence, similarly 
with L-type domains, the number of equivalence classes of C-type domains of dimension 



is finite. 
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Table 1: Equivalence classes of primitive C-type domains 



N = 1 $J 



Voronoi vectors 

{[0U*i]} 



Primitive C-type domain 



°>o- 



N 
N 



$2 



{[0],[e 1 ],[e 2 ],[e 1 +e 2 ]} 

{ J2k=i *fe e fe : «fc = or l| 



v(*8)=ni<i<j<3 



^°(ei-ej,ei +ej) 



N = 4 



$f := 



{ [eLi *fc e *] : »* = ° r i} v (*o) = rii<i<i<B ff -°( e i - e i> e i + e i) 

$4 u {[ei - e 2 ]} \ {[e x + e 2 ]} V($f) = (ei + e 2 , ei - e 2 ) 



$ 2 



= ^U{[ei-e 2 ],[e 3 -e 4 ]} 
\{[ei +e 2 ], [e 3 +e 4 ]} 



n (n 3 < 4 <,<5^-° (ei-e i)ei + ei ) 

V($f ) = ( ei + e 2 , ei - e 2 ) n (e 3 + e 4 , e 3 - e 4 ) 

n ( n fc =3,4 ( e i + e 2 + 2e fc , ei - e 2 ) 
n (rife = i, 2 H-° (ea + e 4 + 2e fc , e 3 - e 4 ) 

nfnt=x- H "-°( e i- e 6» e < + e B) 

n(n i=i-2,fc=3,4 if- (ei + e 5 + 2efc, 

\ or t=3,4,fc=l,2 



e 5 ) 



a Here, we put ejv+i := — J"^— i e$ in every TV-dimensional case. 

'Among the above domains, only V(<1?2) ' s n °t an ^-type domain, but is a union of two L-type 
domains. L-type domains for TV = 4 are listed in ,3(3 . 



Table Q] lists all the representatives of equivalence classes of C-type domains for 
1 < N < 4. Each domain V($*) in Tabled] has a set of extreme rays provided by: 



M($§) := : w = e t (1 < i < N),e t - : (1 < i < j < N)} , 

M($t) := (M($ u )U{ J D 4 })\{w t «:« = e 1 -e 2 } ) 

M($£) := (Af($^)u{ t ; t t ;:w = e 1 +e 2 -e 3 -e 4 }u{D 4 , J D 4!2 })\{w* U : U 
where D 4 and -D 42 are the equivalent perfect forms corresponding to the root lattice 



(56) 
(57) 

i - e 2 ,e 3 - e 4 } , (58) 



D 4 := 



(2 1-1 -1\ 

1 2-1-1 

-1-12 

\-l -10 2 / 



,04,2 := 



/ 2 0-1 

2-1 

-1 -1 2 

V-l -1 1 



-1\ 

-1 

1 

2 / 



(59) 



The following proposition claims that every facet of a primitive C-type domain is 
associated with a set of four vectors satisfying the parallelogram law. Although this is 
the most important property in our discussion, we could not find references mentioning 
this explicitly. 

Proposition 5.1. Suppose that two primitive C-type domains V($s x ) ^ V($s 2 ) have 
^ jv(jv+i) _ \^.^i mens i ona i cone as their intersection. Then $s x n $s 2 contains exactly 
2 N — 1 elements. Hence, there exist u,v € Z N such that $>Si\$s 2 — {[ u )} an d &s 2 \&S! = 
{[«]}. V($s 1 U $52) C H(u,v) is a common facet o/V( < T J 5 1 ) and V($s 2 )- Furthermore, 
{"±3^ 3tr«j. j s a primitive set of Z N , and [^], [^] are elements of $ Sl fl $s 2 - 
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'uS u 

v(o 1 ) o O V(* 2 ) 

'vS v 

Figure 5: Representations associated with an edge of a topograph. 

Proof. From V($ Sl ) ^ V($s 2 ), there exist u^vi such that Ui + 2Z N = v x + 2Z N , 
[ui] G \ $s 2 , and [vi] G $s 2 \ ®Si- Since ui, «i are Voronoi vectors of any S3 G 



V(*si) n V($s a ), we have *uiS 3 Z < HSl and h^Sal < HSL Hence, ^±S. and £ 



2 2 

are also Voronoi vectors of S3. Replacing Wj with gui, gvi (g G GLjv(Z)) if necessary, 
we may assume Ul ~ Vl = ei and Ul + Vl = mei + ne2 for some m, n G Z. Then 
and Hi±2i are Voronoi vectors of §2 '■= ( t eiS , 3ej)i<j J < 2 G 5j - Hence S2 belongs 
to V({[ Bl ^"' ],Mi}), which is equivalent to V($q) m Table [TJ As a result, there exists 
g G GL N (Z) such that g( " 1 ~'" 1 ) = e x and ff( " 1 +" 1 ) = e 2 . Hence, it is concluded that 
I «i +m ^ ni-Di 1 j g a p r j m j^i ve se t f Suppose that there exists another (112,1*2) 7^ 
satisfying u 2 + 2Z N = v 2 + 2Z N , [u 2 ] G \ $S 2 , and [t> 2 ] G $s 2 \ $Si- From 
the dimension of V($Si) fl V(<i>s 2 ), the following must hold for any 53 G S N : 

X U -^)Ss( U -^) = <=► t (^)5 3 (^) = 0. (60) 

We may now assume Ul ~ Vl = e % and Hiim = e2 . Then, [i>i]} = {[1*2], [V2]} is 

easily obtained. Therefore, &s 1 \ <&s 2 and $s 2 \ $s x consist of only one element. Now 
the remaining statements follow immediately. □ 



By Proposition 15. 11 it is proved that the decomposition (|54|) is a facet-to- facet tes- 
sellation. Hence, (I5"4")l is also a face-to-face tessellation by a theorem of Gruber and 
Ryshkov [13]. 

Using the ^-partitioning ([M]) . a topograph is defined for general N. 

Definition 5.1. Define CT/v as the graph which has Pn, En as its sets of nodes and 
edges, respectively. 

P N := |V($s) : S G S£ , $ s ™ primitive} , (61) 
E N := {e V ($i),v(* 2 ) : 7^ V($ 2 ) € P N share a facet) , (62) 

where ey(* 1 ),v(* 2 ) connects two nodes V($i), V($2) G Pat- When Sq G 5^ is fixed 

arbitrarily, edges in En are associated with two representations of Sq over Z by the 
map: 

/s (ev(*i),v(* 2 )) : = VuSqu, tySov}, (63) 

where u, v G Z N are taken so that {[u + v]} = $Si\ < i ) S2 anc ^ {i u ~ v ]} — ^s 2 \ ®Si ■ Such 
an edge is represented in Figured Furthermore, the direction of the edge is defined as in 
Figure^ Assuming every edge is oriented by this, we call the pair CTn,s := (CT/v, fs ) 
a topograph of Sq . 



5.2 Topographs for low- dimensional lattices 

In this section, the structures of topographs for lattices of rank N — 2,3 are explained. 
Since the same topic is also discussed in [B] , we mention only basic facts necessary in the 
following sections. In N = 2, 3, the structures are also determined from the partitioning 
(|4"Tj) of the Selling reduction. In particular, the set of nodes is provided by 

Pn := {V% el [g] = V( t g- 1 ^):geGL N (Z,)}. (64) 
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a)'(u + v)S (u + v) b) '(u+v)S (u+v) c) "(u + v)S {u+v) 

<'(u-v)S l) (u-v) >'(u-v)S (ii-v) ='(u-v)S {u-v) 



V(Oi) ' uS o u y(d>,) v(U>,) ' uS o 11 y(a> ? ) V(<D 1 ) ''-''V' v(©i) 



-> — O 



'vS v 'vS v 'vS Q v 



When V(<E»i), V("I > 2) £ -Pjv have a common facet, there exists u,v S such that <J>i \ $2 = 
{±(m + v)}, $2 \ ^1 = _ f )}• In this case, the direction of the edge connecting V(<&i) and V($2) 

is defined as in [6]. 

Figure 6: Direction of edges of a topograph CTjv,s . 

By Voronoi's second reduction theory, every node T>g el [g] is associated with t g~ 1 $Q . 

In the following, a lattice L of rank N, a basis 61, • ■ ■ , bjy of L are fixed, and a matrix 
(61 • • • 6,/v) is denoted by B. In order to clarify t BB is the Gram matrix of (L,B), 
we utilize the following notation, instead of ft BB , CT N t BB . 

/(L,s)(ev(*i),v(* 2 )) : = /*ss(ev(* 1 ),v(* 2 )) = {\ B u\ 2 , \Bv\ 2 }, (65) 
CTat,(£,b) := CT N t BB . (66) 

where u,v £ 2, N are chosen as in Definition 15.11 

Basic properties of CT 2i (l,b) and CT 3i ( iiB ) are explained in the following examples. 

Example 1. Case of CT 2 i LB \. T^% e l ~ ^(^0) * s a polyhedral cone surrounded by the 
three hyperplanes in Table [71 Hence, a node of CT 2 is adjacent to three nodes, as in 
Figure 




/, -/, L> I +/,!+ /. -/. 



;r=^r + i<.r 



(2) 

Let e3 := — ei — e2, and r- , be the 2x2 matrix satisfying 



r^fei = -Bj, rlfej = e*. (67) 



When we put (l\ I2) ■= (61 62) *9 1 and £3 := — Zi — 12 for the fixed basis 61, 62 of L and g £ GL2CZ), every 
node X>f e! [g] = V(*S _1 *o) is an end P oint of three edges e v( ^_i ft aj v(tg -i r m ^ (1 < * < J < 3) that 

are associated with {|ii| 2 , |Zj| 2 }. (In this figure, any edge between two domains labeled |fci| 2 and |fc2| 2 
is considered to be associated with {|fci| 2 , |fc2| 2 }- Such a labeling of domains is achieved by embedding 
the topograph in the upper half plane as in Figure [8] and labeling every domain Del surrounded by 
topograph edges with |(| 2 of the minimal vector I of some S £ i~ 1 {D). This is well-defined, because I 
is the only minimal vector of all S £ l~ 1 (D).) 

Figure 7: Local structure of topograph CT 2 .(L.(b 1 6 2 ))- 
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It can be proved that CT% is a tree as in Figured The tree is embedded in l S^. /IR>o — 
H by mapping a node V( t g _1 $Q) to the perfect form gA^ Xt g, and an edge between 
V(*5^~ 1 $§) and V( 'ff^ 1 ^) ^° ^ e geodesic connecting g\A~^ Xt g\, giAl2 Xt gi. 



\h +2b 




1/2 1 



A one-to-one correspondence between the points of the upper half-plane H := {z £ C : Im(,z) > 0} 
and iS51q/IR>o is given by the map: 



l : x + s/—\y i 



(68) 



Under this identification of H and iS^q/R>o, the action of g £ GZ/2(Z) on the latter set by S >— ► gS t g 
coincides with the action of GL,2(Z) on H by: 



c d ) ' \m if«*-6c=-l, 



(69) 



where z is the complex conjugate of z. 

Figure 8: Embedding of a topograph in the upper half plane. 



Example 2. Case of CT 3 ( L B y T>g el = V($q) is a polyhedral cone surrounded by the 
six hyperplanes in Table [7J Hence, a node of CT3 is adjacent to six nodes, as in Figure 
All the adjacent nodes 0/ V(*<? _1 $o) are given as V(*g~ 1 r^$ u ) (1 < i < j < A), 
where :— — e i an d T if > * s ^ e 3x3 matrix satisfying: 

T-fe l = -e,, r>j'ej = e J; k^i,j iy 3) e fe = e 4 + e fe . (70) 

j4s explained in Figure^ CT3 contains two kinds of circuits of length 3 and 6, which 
correspond to the following fundamental relations of GL3 (Z) : 

^rfirW = ^ € H(A 3 ), (r§K^ 2 = l, (71) 

where i,j,k,m are integers satisfying {i, j,k,m} — {1,2,3,4}, and o~ik.j m S H{A 3 ) is 
i/ie 3x3 matrix satisfying: 

o-ik,j m e p = -e, /or any (p,g) = (i, fc), (j,m), (m, j). (72) 



6 Main results for the distribution rules of systematic 
absences 

We now discuss on the distribution rules of systematic absences on a topograph. Those 
used in our algorithm are described as theorems, whereas other properties important for 
powder auto- indexing algorithms are mentioned as facts. As proved in Proposition ^. 1[ 
there are only a finite number of types of systematic absences, and they are classified 
by a triplet (G, H, x + V^ r h ). 
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When we put (Zi I2 I3) := (61 62 63) *<? 1 and Z4 := — } , for the fixed basis 61,62)63 and 
g S GLs(Z), each node fs ei [<?] = V(*g -1 'I , g) is an end point of six edges associated with {|Zi| 2 , |2j| 2 } 
(1 < i < j < 4). For any k,m} = {1,2,3,4}, two edges associated with {|Z;| 2 , \lj | 2 }, {|ifc| 2 , |Zm| 2 } 
are contained in a circuit of length 3. On the other hand, two edges associated with {|2j| 2 , |'j| 2 }, 
{|'i| 2 , Kfc| 2 } are contained in a circuit of length 6. 

Figure 9: Local structure of a topograph CT 3 ^ L ^ bl b2 b3 yy 

It is not difficult to prove our theorems if T ext (G, H, x + V^~' r h) is contained in 
Hg,H in (|5Tj) . (This does not occur in the case of lattices of rank 2.) Otherwise, too 
many case-by-case considerations are required. Hence, the most difficult part of the 
theorems is confirmed by direct computation, using the International Tables (N = 2) 
and executing a program (N — 3). For confirmation of the case N = 3, we verify that 
the program outputs exactly the same list as the International Tables. 

The most important property of T ext {G,H,x + V^-' r h) is that L* is generated by 
elements of L* \T ext (G, H, x + V^ r h ), under the assumption that L is the period lattice 
of the periodic function p. Furthermore, if x + V R u C V H2 holds for some subgroup 
H c H 2 C G, it is clear that T ext (G, H,x + V E «h ) = r ext (G, H 2 , x + V 12 "^ ) holds. 
Therefore, the following types of systematic absences may be regarded to be invalid, 
and removed from the following consideration. 

1. L* \ T ext (G, H, x + V^-' r h ) is contained in the reciprocal lattice L\ of some L2 2 L. 

2. x + V^ r h c V H2 for some subgroup H C H 2 C G. 

6.1 Cases of rank 2 

According to the International Tables, there are 17 wallpaper groups and 72 types of 
systematic absences, including invalid ones. By direct computation (or by seeing tables 
in [H]), it is verified that T ext (G, H,x + V^ r h ) = T ext (G, L, V) holds in all the valid 
cases. 

In the following, we introduce a short proof of Theorem [T] in the case H = L using 
a topograph. As a result, Theorem [T] follows for general cases. 

Lemma 6.1. Let L* be a lattice of rank 2, and R C O(N) be the automorphism group 
of L* . If tI* = l\ holds for some 1 ^ r € R and primitive vector of I* G L* , either of 
the following holds: 

(a) there exists 1% € L* such that I* ■ l 2 — and l*,l 2 is a basis of L* , or 
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(b) t 2 — 1 and there exists l 2 G L* such that I* = /| + Til% and ni| is a &asis of L* . 

Proof. Fix a Gram matrix So of L* . Let ^ G L* be the vector satisfying {l*,^} G 
P 2 (£*), • l* 2 > and |^| 2 = min{|^| 2 : {^,Zf} G P 2 (i*)}- The edges of CT 2 , So then 
have the direction presented in Figure [TU] The left figure corresponds to the case 




Figure 10: Directions of edges of a topograph associated with |ZJ| . 

In the case of the right figure, r fixes the node surrounded by |Zi| 2 , \h\ 2 , and \h — h\ 2 - 
If we recall that the stabilizer of V($q) is given by H(A3), the action of r or — r on 
{/*, — i|) —11 + I2} coincides with a permutation of l*, — Z|, —I* + Z|. From t^I and 
rl* = l\, we obtain tI% = /* — ^- This is equivalent to the case □ 

Proof of Theorem]!] in the case of H — L. From Corollary 14. 21 for any primitive vector 
I* G T ext (G), there exists r G R with r/J = l\ such that v T -l\ ^ Z. If (JbJ) of Lemma RTT1 
holds, then: 

v r -l\ = v\+ T ■ l* 2 = u T 2 • Z* = ^ • Z* e Z. (73) 
Hence, the statement of the theorem follows. □ 

6.2 Cases of rank 3 

For N — 3, we consider the following cases separately: 

(i) T ext (G,H,x + V T " 1 h ) n Pi(L*) is contained in H Gi h in (EJJ; 

(ii) r ext (G, H, x + V^ r h ) n Px{L*) is not contained in H g ,h- 

Table lists all valid types of systematic absences corresponding to the latter case. 
For each type, il C L* /ML* in Corollary [O] is given in Table [3] 

First, we shall explain some more details of Ito's method, and determine why it does 
not work appropriately for some types of systematic absences. It was Ito [15] who first 
proposed using the parallelogram law for powder auto-indexing; if the observed q- values 
91,92,93,94 G A obs satisfy 2(qi + q 2 ) = q% + (74, the method assumes that there exist 
lt,l 2 G L* satisfying the following: 

9i = l^| 2 , 92 = |^| 2 , 93-|^+^| 2 , q^\ll-l* 2 \ 2 - (74) 

If (f?"4"|) is true, the Gram matrix of the sublattice expanded by l\,l 2 is determined. In 
order to obtain candidate solutions for L*, it is necessary to construct a 3 x 3 Gram 
matrix from combinations of these sublattices. In order to simplify the combination 
procedure, it is very desirable that {l{,l 2 } in (|74| is a primitive set of L* . Otherwise, 
it is necessary to check whether L* has rank 3 sublattices that are more plausible as a 
solution once L* is obtained. The program of Visser [3T] which adopted Ito's method 
also implicitly requires {11, 1%} G P 2 (L*) [TU] , 

However, according to the following fact, in some types of systematic absences, 
{l\J 2 } C L* is never a primitive set of L* , if {I*, l 2 } satisfies ((74)) . 
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Table 2: Types of systematic absences having T ext {G, H, xli+yl2+zls+V^ R H ) (£ %g,h- 



Space group G (No. a ) Rjj (x,y,z) 
Type-A (Face-centered lattice) 



F d d 2 (43) 
F ddd (70) 
F ddd (70) 
F ddd (70) 
F d 3 (203) 
F d 3 (203) 
F d 3 (203) 
F 4i 3 2 (210) 
F4i 3 2 (210) 
F 4i 3 2 (210) 
F dim (227) 
F dim (227) 
F dim (227) 



C 2 

C 2 

7J 2 

D 2 

C 2 

T 

T 

C 2 

T 

T 

C 2 „ 

T d 

2d 



(0,0,2) 

(a, 0,0) 
(0,0,0) 

'-2' 2' 2' 

(a;, 0,0) 
(0, 0, 0) 
(I I 1) 

<. 2 > 2' 2' 

(a:, 0,0) 
(0, 0, 0) 

fl I 1) 

'-2' 2' 2' 

(z,0,0) 
(0, 0, 0) 

(I I II 

>. 2 ' 2 ' 2 I 



Type-A (Body-centered lattice) 



I 4i (80) 
7 4i/a (88) 
I 4i/a (88) 
7 4i/a (88) 
7 4i 2 2 (98) 
7 4i 2 2 (98) 
I 4i 2 2 (98) 
7 4i m d (109) 
/42d (122) 
/42d (122) 
7 4 2 d (122) 
7 4i/a m d (141) 
7 4i/a m d (141) 
I Ax/ am d (141) 
I Ai/amd (141) 
7 4i/a c d (142) 



C 2 
C 2 
S 4 
5 4 
C 2 
D 2 
D 2 
C 2 „ 

c 2 
s 4 
s 4 
c 2 

C 2v 

D 2d 
D 2d 
C 2 



(0,0,2) 

(0,0, z) 
(0,0,0) 
(0,0, i) 

(0,0,2) 

(0,0,0) 

(o,o,i) 

(0,0,2) 
(0,0,2) 

(0,0,0) 
(0,0,1) 
(x,x,0) 

(0,0,2) 

(0,0,0) 
(0,0,1) 



Type-B (Face-centered lattice) 



F ddd (70) 
F ddd (70) 
F d 3 (203) 
F d 3 (203) 
F 4i 3 2 (210) 
F 4i 3 2 (210) 
F dim (227) 
F dim (227) 



Q 

Ci 

C 3 i 

C 3 i 

D 3 

D 3 

D 3d 

D 3d 



Type-B (Body-centered 

7 4i/a (88) C 

I 4i /a (88) C, 

7 4i/o m d (141) C 2h 

7 4i/a m d (141) C 2h 
Type-C 

I 4i/a m d (141) C 2 

/ 4i/o c rf (142) C 2 
Type-D 

P 3 1 c (159) C 3 

P 3 1 c (163) C 3 

P 3 1 c (163) D 3 

P 3 1 c (163) P> 3 

P 6 3 (173) C 3 

P 63/m (176) C 3 

P 63/m (176) C 3/l 

P 63/m (176) C 3h 

P 63 2 2 (182) C 3 

P 63 2 2 (182) D 3 

P 63 2 2 (182) 7J> 3 

P 63 m c (186) C 3v 

P 6 2 c (190) C 3 

P 6 2 c (190) C 3h 

P 6 2 c (190) C 3h 

P 63/m mc (194) C 3 „ 

P 63/m mc (194) D 3h 

P 63/m mc (194) D 3ft 
Type-E 

P 62 (171) C 2 

P 64 (172) C 2 

P 62 2 2 (180) C 2 

P 62 2 2 (180) D 2 

P 62 2 2 (180) 7J> 2 

P 64 2 2 (181) C 2 

P 64 2 2 (181) D 2 

P 64 2 2 (181) P> 2 
Type-F (Primitive) 

P 4 2 3 2 (208) C 2 

P 4 2 3 2 (208) C 2 

P 4 3 n (218) C 2 



lattice) 



- 1) 

4' 8- 1 

2) 

if 

I'I 
4' 8- 1 



1 2 

l'l 
I'I 
l'l 

l'l 
I'I 
1'? 
l'l 
I'i 
l'l 
l'l 
l'l 
l'l 
l'l 
I'i 
l'l 
I'I 

l'l 

3 > 3 



1 1 

l'l 
f 2 
I 

T 



l) 



*) 
*) 

i] 

z) 
z) 

i\ 

z) 

i\ 

i> 
z) 
z) 

% 

z) 



z) 
z) 

,0,2) 

,0,0) 

,0,2) 



I 



,0,0) 

5) 



*,A,0) 
^,0,1) 
x,|,0) 



P 4 3 n (218) 

P m 3 n (223) 

P mln (223) 

P mln (223) 

Type-F (Body- 

7 4 3d (220) 

7 a 3 d (230) 

Type-G 

P 4 2 3 2 (208) 

P 4 2 3 2 (208) 

P 4 3 n (218) 

P 4 3 n (218) 

P mln (223) 

P mln (223) 

Type-H 

P 4 3 3 2 (212) 

P 4 3 3 2 (212) 

P 4i 3 2 (213) 

P 4i 3 2 (213) 

Type-I 

7 4i 3 2 (214) 

7 4i 3 2 (214) 

Type-J 

7 4i 3 2 (214) 

7 4i 3 2 (214) 

7 4 3 d (220) 

7 4 3 d (220) 

Type-K 

7 4i 3 2 (214) 

7 4 3 d (220) 

7 a 3 d (230) 

Type-L 

I a 3 d (230) 

Type-M 

7 a 3 d (230) 

7 a 3 d (230) 

Type-N 

7 a 3 d (230) 



C 2 
C 2 

c 2v 
c 2v 

centered 

C 3 
C 3 

D 2 

D 2 

S4 

S4 

D 2d 

D 2d 

D 3 
D 3 
D 3 
D 3 

D 3 
D 3 

D 2 
D 2 
S 4 
S 4 

C 2 
C 2 
C 2 

C 2 

D 2 
S 4 

d 3 a 



"Number assigned to every space group in 1141 . 

'The fractional coordinate (x, y, 2) indicates the point xl\ + yl 2 + yl 3 mod L 2 £ V (x, y,z £ [0, 1)), 
where L 2 C L is the Bravais lattice, and 'i , ^2 , '3 is the basis of L 2 chosen by the method in "Notation 
and symbols". 



s) 
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A 
B 



Table 3: Necessary and sufficient condition for 53<=i u ^*i 4- T~Lg,h to belong to 

T ext (G,H,x + V^*H). 

Type Necessary and sufficient condition 

face-centered => i*i + 1*2 + 113 = 2 mod 4 
body-centered => 2i*2 + 113 = 2 mod 4 
face-centered => {1*1, 1*2, 1*3} = {0, 0, 2}, {0, 2, 2} mod 4 
body-centered => {m , u 2 , 1*3} = {1, 1, 0}, {3, 3, 0}, {1, 3, 2} mod 4 
C (1*1,-1*2,1*3) = (1, 1,0) mod 2. 
D 111 = U2 mod 3, 113 = 1 mod 2. 
E 1*1,1*2 = mod 2, 1*3 ^ mod 3. 

primitive => {1*1,1*2,1*3} = {1, 1, 1} mod 2 

body-centered => {1*1, ua, "3} = {0, 0, 2}, {2, 2, 2} mod 4 
{0,0,1}, {1,1,1} mod 2, and 
^ {0,1, 2}, {0,2, 3} mod 4. 

{0,0,0},{0, 0, 1} mod 2, and 
^ {0, 1, 2}, {0, 2, 3}, {0, 0, 0}, {2, 2, 2} mod 4. 
{1*1,1*2,1*3} = {0,0,2},{0, 2, 2} mod 4. 

f ^ {0, 0, 0}, {0, 2, 2}, {1, 1, 2}, {1, 2, 3}, {2, 3, 3} mod 4, and 

I ^ {0, 2, 4}, {0, 4, 6}, {0, 1, 1}, {1, 1, 4}, {0, 3, 3}, {3, 3, 4}, {0, 1, 7}, {1, 4, 7}, {0, 3, 5}, {3, 4, 5} mod 8. 
K {1*1,112, 1*3} = {2,2, 2} mod 4. 

L {ui,«2,«3} = {0,1,1}, {0,1, 3}, {0,3, 3}, {2, 2, 2} mod 4. 

{0, 0, 0}, {0, 2, 2}, {1, 1, 2}, {1, 2, 3}, {2, 3, 3} mod 4, and 
I ^ {0,2, 4}, {0,4, 6} mod 8. 
{«i, 1*2,1*3} ^ {0,0, 0}, {1,1, 2}, {1,2, 3}, {2, 3, 3} mod 4. 



G {1*1,1*2,1*3} 

H {mi, 112,5*3} 

I {1*1,1*2,1*3} 

J {1*1,1*2,1*3} 

{1*1, 1*2, 1*3} 
{1*1,1*2,1*3} 

M {1*1,112,1*3} 



a {l* , 1%, l 3 ) is the reciprocal basis of the basis Zi ,l2,l 3 of R 3 taken as in the footnote of Table [2] 



Fact 1. // (G, H, x + V r h ) is a type B or N, the following set is empty: 

{{i$,r 2 } g *ml*) : ilWi + 1*2,1*1- 1*2 i r ext (G, x + . (75) 

To remove adverse effects of systematic absences from Ito's algorithm, it has been 
proposed that a formula other than the parallelogram law should be used [5] : 

\l\ + ml* 2 \ 2 - \l* - ml* 2 \ 2 = m(\ll + l* 2 \ 2 - \l* - l* 2 \ 2 ). (76) 

However, it has not been ascertained whether the equation works appropriately for all 
types of systematic absences. As another example, the following was proposed to obtain 
a rank 3 solution directly. 

\n\ 2 + \i* 2 \ 2 + n\ 2 + \n +1*2+ i*s\ 2 = \n + ^i 2 + \n + ^i 2 + n + ^i 2 - (77) 

The above equation has a similar property to the parallelogram law: 
Fact 2. If (G, H,x+ V z ^ r h ) is of type B, C, F, G, or N, the following set is empty: 

{{i*,q,r 3 } g p 3 (l*) : i*,i;,r 3 ,i* + r 2 ,i*i+i* 3 ,i*2 + 1*3,1*1 + 1*2 + 1*3 £ r ext (G, H, x + • (78) 

The following two theorems are intended to resolve any this kind of problems caused 
by systematic absences. There, it is proved that there exist "infinitely many" elements 
of P 2 {L*) or P 3 {L*) satisfying some common properties, and connected subgraphs of a 
topograph containing infinitely many nodes are formed from the lengths of lattice vectors 
belonging to such elements. Existence of infinitely many elements is required not to fail 
to gain the true solution, because only a finite subset A obs C T ext (G, H,x + V^ R n 
is available in computation. From the latter claim, it is guaranteed that rather large 
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Figure 11: A subgraph of a topograph corresponding to the equation 3 1 Z * | 2 + |^i +2^| 2 = 
\2l* + Zjl + 3 1 I and a chain formed from such subgraphs. 

subgraph is formed for the true solution, even from finite elements belonging to A obs . 
This is useful to select better candidate solutions and reduce computation time. 

We shall explain the meaning of Theorem[3]before describing the statements; in short, 
it claims that the equation 3\ll\ 2 + \l* + 2Z^ | 2 = \2l* + l 2 \ 2 + 3 1 | 2 works appropriately 
regardless of the type of systematic absences. When (L* ,B*) represents the reciprocal 
lattice of the lattice to obtain as a solution and a matrix whose columns are a basis 
of L*, this new equation corresponds to the subgraph of CT 2 ^l',b*) consisting of three 
nodes and two edges in the left-hand of Figure fTTl A subgraph of CT 2 ^ L * ,b*) containing 
infinitely many nodes as the right-hand of Figure [TT] is formed by linking such a subgraph 
obtained from q r ,q t ,q s ,q u G A ext (G, H,x + V^ r h ) satisfying 3q r + q t = 3q s + q u . 

Theorem 3. For any type of systematic absences (G, H,x + V^kh ) with G = R g kL, 
the following subset of P 2 (L*) includes infinitely many elements. 

P 2 {L*) := [{lt,l*2\ G P 2 {L*) : mil + (™ - 1)^2 t T ext {G,H,x + V^ r h) for any m G z| . 

More precisely, for any open convex cone U C M. N satisfying UDHg.h = 0, the following 
holds: 

(1) There exists {^i , ^2 } G P 2 (L*) such that {mil + n ^2 '■ m,n E 1>q} C U. If 
Test&H^ + V^B) c Hg,h, such {11, I*} G P 2 (L*) belongs to P 2 (L*). 

(2) Assume that (G, H,x + V^ r h) is one of the types in Table® and {I*, I*} € Pz{L*) 
satisfy 

(a) {11,11} C U , and 

(b) 11, l 2 , 2ll+r 2 ,ll+2l* 2 , 2l'l+2,l 2 , 'il'l+2V 2! 311+ All ( Le -, all the vectors other than 
11 + 11 in the right-hand of Fiaure \ll\) do not belong to T ext (G, H,x + V r h ). 

Then, {11, l 2 } is an element of P 2 {L*). Furthermore, {I*, l 2 } 6 P 2 (L*) satisfying 
the two properties exists. 

Proof. If P 2 (L*) is not empty, it includes infinitely many elements because, for any 
n E Z, {-nil ~ (n- 1)1*, (n + + nl* 2 } € P 2 (L*) belongs to P 2 (L*) if and only if 
{11, l 2 } & P 2 (L*). Hence, the first statement follows immediately from (fT|) and ©. 

((T|) is almost straightforward. In particular, the existence of {I*, I*.} € P 2 {L*) sat- 
isfying {I*, I*} C U is proved in Lemma loT2"l Thus, ml* + nl* 2 € U holds for any 
integers m,n > 0. Therefore, if T ext (G,H,x + V^ r h) c U g ,h, ±(ml* + nl* 2 ) £ 
T ext {G,H,x + V^ r h ) follows. As a result, we obtain {ll,l* 2 } G P 2 (L*). 
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In order to prove ^j, let M be the order of Rq- We first define the following sets: 



T extM (G,H,x + V^ r h) := {/* + ML* : I* G r(G,ff,z + ^ Er ") \ H G , H } , 

f , , \K, l%\ G P^fL*) such that for any m G Z, 

-P2 m{L ) := < m + ML ,1% + ML } : 11 21 x ^ , v 

Note that T exty M{G, H, x+V^ r h ) is equal to fl in Corollarv l4.ll By direct computation, 
it is verified that P2,m(L*) equals the following set: 

r,* , ,, r * ,, r *-, {'*»^} e P 2 (i*) satisfying for some m = fc, •• • ,fc + 6 (fceZ),l 
|Z 1+ ML ,l 2 +ML }: m ^ + ( m -l)i|^r ext>M (G,if,x + y^) J 

Consequently, if {ZJ,^} S P2{L*) with {Z*,Z|} C £/ satisfy the assumption of ©, 
{Z* mod M,l 2 mod M} G P 2tM (L*), and therefore {Z 1; Z 2 } G P2(L*) is obtained from 
the definition of U. Furthermore, P 2 ,m(L*) 7^ holds for any type of systematic absences 
presented in Table |H Therefore, from Lemma [6.21 there exists {Z 1; Z 2 } G P 2 (L*) such 
that ML*, 11+ ML*} G P 2 ,m(L*) and {ll,l* 2 } C £7. In this case, {l\,l* 2 } G P 2 (£*) 
also holds. □ 

The following lemma is used in the proof of Theorem [3J Although it is straightfor- 
ward, we provide a proof. 

Lemma 6.2. Let L C K 6e a lattice of rank N, and 1 < m < N be an integer. Then, 
any open cone U C contains a primitive set {Zi,--- , Z m } G P m (L). Furthermore, 
when M > is a positive integer and {hi, • • • , k m } G P m {L), U contains infinitely many 
■ ■ ■ , l m } G P m (L) satisfying li — ki G -MX /or any 1 < i < to. 

Proof. We prove the first statement by induction. Since {al : a G Q, I G L} = {aZ : 
a G Q, Z G is dense in R N , there exist a G Q and Z G P\{L) such that aZ G U. 

Hence, Z G Z7 is obtained. Next suppose that m < N and there is T G P m with T C U. 
Then there exists Z G L such that T U {1} G -P m +i- Furthermore, for any I2 G T, there 
is e > such that J7 2 := {a; G l w : (1 — e)|x| 2 |Z 2 | 2 < (x • Z 2 ) 2 } is contained in Z7. Clearly 
there is an integer s > such that Z + sZ 2 G U2. As a result, T U {Z + sZ 2 } G -P n +i is 
a subset of U. In order to prove the second statement, we fix a basis l\, ■ ■ ■ ,In G U 
of L. Then there exists g G GL(Z/MZ) such that Zc^ = <?Zi for any 1 < i < m. When 
the subgroup of 5 G G£(Z) with positive entries is denoted by GL+(Z), the natural 
map GL + (Z) — s- G := {a. G GL (Z/A/Z) : deto. = ±1 mod M} is an epimorphism. 
Let go G GL + (Z) be an element belonging to the inverse image of g mod M. Then 
9oh, ' " j 9qIn are all contained in U and satisfy goh — ki G AfL (1 < i < m). □ 

Theorem 0] claims that a solution of rank 3 is obtained from the combination of 
two subgraphs of a topograph for lattices expanded by I* and I* (i = 2, 3) as in the 
right-hand of Figure HU and \±l* + l? 2 + l* 3 \ 2 G A ext (G, H, x + v' Er h ). By using such 
subgraphs with infinitely many edges for the purpose, the "sort criterion for zones" 
proposed in Section 17.31 is provided a theoretical foundation. 

Theorem 4. Using the same notation as Theorem^ let P^{L*) be the set of {ll,l 2 ,l^} G 
Ps(L*) satisfying (a) and (b) with both £ = 2,3. 

(a) ±l{ + l* 2 + l* 3 i T ext (G, H,x + V^^-h ). 

(b) ml* + (to - 1)(-Z* +1*) i T ext (G,H,x + V^ r h) for any to G Z, or ml* + (m - 
1)(ZJ -I*) i T ext {G,H,x + V^"H) for any to G Z> . 
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Table 4: Density of L* \ T ext {G, H,x + V^ r h ) in L*. 
Type P 2 m{L*) in P 2 , M {L*) P 3 ,m(L*) in P 3 ,m(£*) 



A 


0.321 


0.286 


B 


0.286 


0.143 


C 


0.476 


0.190 


D 


0.341 


0.209 


E 


0.736 


0.604 


F 


0.714 


0.571 


G 


0.214 


0.027 


H 


0.429 


0.058 


I 


0.714 


0.571 


J 


0.071 


0.022 


K 


0.857 


0.786 


L 


0.107 


0.004 


M 


0.036 


0.004 


N 


0.036 


0.004 



a M is the cardinality of Rq. The densities are computed by dividing the cardinality of m(L*) by 
that of Pi m(L*) {i = 2,3), where P; ; m is defined by 

PiM L *) :={(!* + ML*,- ■■ ,1%+MV) : {It,-- - ,l%}EPi(L*)}. (83) 

'The densities of G, H, J, L, M, N, which consist of only primitive and body-centered cubic lattices, 
are rather small. We were not able to find another combination of g-values with larger densities. As a 
practical measure against small densities, the parallelogram law is used in the enumeration procedure 
of Table|6] in addition to 3^ | 2 + |jj + 21* | 2 = 1 2l\ + ^| 2 +3|f|| 2 . (This is effective except for the type 
N, owing to Fact [Tl ) Furthermore, as explained in Section l7.3l condition Jb} in Theorem|4]is relaxed in 
the actual algorithm, and subgraphs with relatively many edges are prioritized. 
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Figure 12: Connected subgraph of a topograph consisting of edges associated with 
{{k, mil + (m- l)(~h + !*)}: to e Z} or {{h,ml* + (m - !)(/* - I*)} : to e Z> }. 



Pa(L*) i/ien contains infinitely many elements, regardless of the type of systematic ab- 
sences. 

Remark 2. This theorem may be considered to refer the vectors associated with the 
edges in the circuit of length 6 in Figure 03' the 6 edges are associated with one of the 
four vectors in the following parallelogram law: 

(a) 2(\h\ 2 + \h + h\ 2 ) = |li + h + + \-h + h + k\ 2 , 

(b) 2(|Z 1 | 2 + |^| 2 ) = |Zi+Z,| 2 + Hi+^| 2 (i = 2 or 3). 



Proof. By Lemma l6.21 any open convex cone U C M. n \Wc,h contains {ZJ, — + — IJ + 
I3} e P*{L*). Hence, U includes ±l\ + 1* 2 + 1%, ml\ + (m - l)(-h + I*) (m € Z) and 
ml* + (to - 1)(Z* - I*) (to € Z> ). Consequently, if T ext {G, H,x + V^ r h ) is contained 
in T-Lg,Hj the statement is obtained immediately. 

If (G, H, x + V^ r h ) is one of the types in Tabled let M be the order of Rq, and 
^ext,Ai{G,H,x + y^ff) and P2.m{L*) be the sets defined in the proof of Theorem[3] 
Furthermore, we define 



r(L*) := 



ffi,i 2 *,ii}6P 3 (L*), 

{«! + JW-L ,l 2 +ML ,l 3 + ML ^ + _^ _ ^ + Qr {J . + MJ . % _ n _ n + 

belongs to P 2 ,m(L*) (2 < i < 3) 



By direct calculation, it is verified that P^,m(L*) 7^ 0, regardless of the type of sys- 
tematic absences (See Tabled]). From Lemma RT21 there exist infinitely many {/*, — I* + 
1*2,-1*1+1*3} e P 3 {L*) such that {l{+ML*,-l{+l%+ML*,-l{ + l% + ML*} £ P 3 ,m(L*) 
and {11, -It + 12,-11 + 11} C U. In this case, {IJ,I|,Z£} € P 3 (i*) also holds. □ 



7 Algorithm for powder auto-indexing 

In order to elaborate our new algorithm, we first define a data structure that is assumed 
to be implemented in the program. The set A obs of observed lattice vector lengths is 
input as an array ((g,[0],qj[l]) : 1 < i < M) consisting of pairs of a q- value qi[0] and its 
estimated error qi[l] = Err[qj[0]]. In our algorithm, every candidate for a Gram matrix 
of L* is provided as a matrix with entries n iQi[Q] having coefficients rii G iZ. At 

various stages of powder auto-indexing, the propagated errors of the entries are useful 
for making statistical judgments and strengthening the algorithm against observation 
errors in the g-values. For this purpose, a data structure of a formal sum of elements 
of A obs with coefficients n, € 4]\fZ is implemented in Conograph (N is a fixed positive 
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Table 5: Procedures for enumeration of four q- values satisfying the parallelogram law. 

void enumerateItoSolutions(A o6s , c, Ans) 
(Input) A obs : array of N peak pairs of a q- value qi[0] and its approximated error qi[l]. 

c > : parameter setting error tolerance level. 
(Output) Ans : array of a sequence ({q r , q s }, {qt, Qu})> where q r , q s , qt,q u are elements of A obs sat- 
isfying 

!|Val(2q r + 2q s — q t — q u )\ < c min{2Err(q,. + q s ), Err(g t + q u )} (Parallelogram law), 
( VqM - VqM) 2 < qt [0] , q u [0] < ( ^/q7\0] + VSl) ~ (Positive-definite condition) . 



1: (Start) Set a sorted sequence S :— + qj : 1 < i < j < N peak ) of formal sums. 

2: for % := 1 to jN peak {N peak + 1) do 

3: Let 1 < J rnin , J max < N peak be integers satisfying 

4: J mm < j < J max ^ |Val(25[i] - S[j})\ < 2cErr(S[*]). 

5" for J . - — - 'J?nin to Jmax 0-0 

6: if |Val(2S , [i] - S[j])\ < cErr(S[j]) then 

7: {q r ,q s } ■= get Terms (S[i\), 

8: :=getTerms(5[i]). 

9: if (vOTI - V^IOJ) 2 < ft[0],?«[0] < ( V / ^[0l+ V^IOJ) 2 thcn 

10: insert ({g r , g s }, {q t , g u }) in Ans. 

11: end if 

12: end if 

13: end for 

14: end for 



integer). The data structure is equipped with the order < and functions get Terms, Val, 
Err as follows: 



M M M M 



y^atqi <V&;Cfc V a l q t [0) < V hq^O], (85) 

* — ' ^ — ' def * — ' z — * 

i=l i= 1 i= 1 ?—l 

getTerms ^ n t q)j := : 1 < % < M, m ^ 0}, (86) 

(M \ M 

^n i% := £ «*?<[()], (87) 

t=l / i=l 

/ M \ / M \ 1 / 2 

Err := $>?fe[l]) 2 ■ 



In particular, if Val and Err are called with the argument riiqi, they return the 



value and the propagated error of Y2i=i n iQi[Q)> respectively. 



7.1 Algorithm for iV = 2 

In this case, according to Theorem [TJ the method using the parallelogram law works 
sufficiently. 

On output of the procedure in Table[5J each entry ({q r , q s }, {qt, qu}) in Ans satisfies 
the parallelogram law 2(q r + q s ) = q t + q u , and corresponds to the 2x2 positive-definite 
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symmetric matrix: 

(\a a 1 \a f a 1W (, V&1(<Zr) £Val(g t - g r - 

H?r,g.h{9t,?u>j-^^ Val(gt _ ?r _ ga) Val(5s) j. («9) 

Here we used the assumption that there exist l 2 £ L* such that q r = \l*\ , q s = |^| 2 ; 
q t = \l* 1+ l*\ 2 . 

7.2 Algorithm for N = 3 

The theorems in Section 16.21 state how to construct the Gram matrices of lattices of 
rank 3 from elements of A obs satisfying the equation 3|^| 2 + \l* + 2l* 2 f = \2l\ + l 2 \ 2 + 
3 1 ^2 1 2 - Considering the case in which powder diffraction patterns contain only a small 
number of peaks, it is better to also use q- values satisfying the parallelogram law in the 
enumeration algorithm. Hence, the following two computational assumptions are used 
in the algorithm of Table [5] 

(CI) If q r ,q s ,q t ,q u € A obs satisfy 2(q r + q s ) = q t + q u , then there are i*, 2^ G L* such 
that q r = \lt\ 2 ,q s = \l* 2 \\q t = \l{ +l* 2 \\q u - |fj - l* 2 \ 2 . 

(C2) If q r ,q s ,qt,qu G A obs satisfy 3q r + q t = 3q s + q u , then there are G L* such 
that q r = \lt\ 2 ,q s = \l* 2 \ 2 ,qt = \l{ +2l* 2 \ 2 ,q u = |2fJ + l* 2 \ 2 . 

The computation time of the procedure in Table [5] is roughly proportional to N zone , 
where N zone is the size of A 2 immediately after @. This is estimated as follows. The 
size of A3 in ([3]) is approximately ^N zonei if the cases Q\ = Q2 or Q3 = Qa are ignored. 
When Np ea k is the cardinality of A obs , the average number of (i?i, R2, R3, Ra) G A 2 
satisfying (a) or (b) with regard to fixed (Qi, Q2, Q3, Qu) G A 2 is approximated as 
4 jv *°T ' Hence, tne num ber of combinations of (Q\, Q2, Q3, Qa), (Ri, R2, R3, Ra) G R2 
and q k G A obs is roughly equal to 4N zone ■ ■ N peak = 16N 2 one . Steps (l)-(3) take 

much less time than (4). Hence it is concluded that the time is proportional to N zone . 

The enumeration is completed by calling the function in Table [6j after which the 
following procedures are required before outputting solutions. 

1. Transform every enumerated Gram matrix into a Niggli-reduced form (22) (which 
is standard in crystallography). 

2. Bravais lattice determination. 

3. Sort solutions by some figures of merit (e.g., de Wolff figure of merit, widely used 
in powder auto-indexing [TTjV 

4. Remove duplicate solutions. 

7.3 Speed-up method using topographs 

As described in Section 17.21 the computation time of the algorithm in Table [5] is pro- 
portional to the square of N zone in A2 immediately after @. Thus, an effective way to 
speed up the algorithm is to reduce the size of A 2 . 

When reducing the size of A 2 , it is necessary to retain elements obtained from 
q r ,q s ,qt,q u G A obs for which the assumption (CQ} or (CJ2) is true, because they are 
essential to obtain the true solution. A new criterion to sort the elements of A 2 is 
defined for the purpose. 

Every entry ({Ri, R2}, {R3, Ra}) of A 2 consists of four formal sums satisfying the 
parallelogram law 2(Ri + R 2 ) = R3 + Ra- Hence, it corresponds to the subgraph 
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Table 6: Enumeration algorithm for three-dimensional lattices constructed from q- 

values. 

void enumerateThreeDimLattices(A obs , c, det S m in, detS max , Ans) 
(Input) A obs ,c : same as in Table [5] 

detS m i„, detSmax ■ threshold on minimum and maximum determinants of 
output matrices. 

(Output) Ans : array of 3 x 3 positive-definite symmetric matrices. 

(1) By the method in Table [SJ enumerate ({q r , q s }, {qt, Qu}) of A obs satisfying the 
parallelogram law and insert in A2. (Here, A2 is an array of four formal sums 

({Qi,Q 2 },{Q3,Q 4 }).) 

(2) Enumerate four elements ({q r , q s }, {(It, qu}) of A o6s satisfying the equation 3q r + q t = 
3q s + q u . This is done by a similar method as in Table [5J Using a new formal sum 
q-i := - 2 ir+qs+q u _ q r -2q s +q t ^ ^ Q getg Q j g_ vaules satisfying the parallelogram law 

are generated from such ({q r , q s }, {qt, Qu})- 

2(q-i + q r ) = q s + q u , 2(<?_i + q s ) = q r + q t - (90) 

Check whether A 2 contains ({q w , q r }, {q s , q u }) or ({q w , q s }, {q r , q t }) for some 
1 < w < Np ea k. If this is false (this suggests that q-\ := ~ 2grH 2 ' ?3+ ' ? " = 
q r -2q 3 +q t j g unc [ e tected in A° bs owing to some observational reason), insert 
({ - 2qr+ 2 9 ' +Qu ,qr},{qs,q u }) and ({ 3r=**±3t , gs} , qt} ) in A 2 . 

(3) for every entry Q 2 }, {Qs, Qi}) € A 2 , insert (Qi, Q 2 , Q 3 , Qi), (Qi, Q2, Qi, Q 3 ), 
(Q2, <9i, Q3, Qi), (Q2, Q\,Qi, Q 3 ) in a new array A 3 . 

(4) For every (Qi, Q 2 , Q 3 , Qi) G A 3 , search (Ri, R 2 , R 3 , R4,) € A 3 satisfying either of 
the following: 

(a) Qi=R x e A obs . 

(b) Qi,Ri£ A obs and |Val(Qi - Ri)\ < cErr(Qx - i?i). 

In addition, for every q^ £ A obs , assume that there exists 11, l 2 , 1% satisfying" 

Qi~Ri = \h\ 2 , Q2 = \h\ 2 , Q 3 = \h + h\ 2 , 

R2 = \h\ 2 , R 3 = \h + h\ 2 , q k = \h +h + h\ 2 - (91) 

Then, the Gram matrix S :— (U ■ lj)i<i,j<3 is obtained as the following 3x3 
symmetric matrix: 

Qi KO3-Q1-Q2) |(iZs - Qx - R2) \ 

UQ3-Q1-Q2) Q2 hiQi-Qs-Rs + qk)}, (92) 

}(R 3 -Qi -R 3 ) \{Qi-Qz- R 3 + q k ) R2 J 

using Val and Err, the values and propagated errors of the entries are computable. 
If AetSmin < detS < detS max , insert S in Ans. 
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"Every three graphs on the left-hand side have a common node, which is an end point of three edges 
associated with {Ri, R2}, {Rl, R4}, and {R2, Ri}- This figure illustrates how these graphs are unified. 

Figure 13: Extension of a topograph (1/2). 




"The left-hand two subgraphs contain the subgraph in Figure ?? commonly. They are unified as 
above. 

Figure 14: Extension of a topograph (2/2). 

of a topograph as in Figure [2] Table [JJ explains how a subgraph T of a topograph 
is constructed from these substructures. The elements of A2 utilized to obtain the 
subgraph are output in A2. 

In the procedure of Table [7J the subgraph T is expanded to only one side. The 
whole subgraph composed of e := ({Ri, R2}, {R3, R4}) G A 2 and entries of A 2 is ob- 
tained by calling the recursive procedure twice, setting e := ({i?i, R2}, R3, Ri) and 
({i?i, R 2 }, R4, R3) respectively in the second argument. Let C(e) be the cardinality of 
the union of the two A 2 output by calling the recursive procedure twice as above. This 
C(e) is considered to quantify the size of the subgraph finally obtained. 

If either (CHJ or (Cj2]) is wrong for e E A2, C(e) will result in a small number, 
because the new q- values required to extend a new edge (i.e., q E A obs in line 2 of Table 
[Jj would rarely be found in A obs . This suggests C(e) provides an effective sort criterion 
for elements of A 2 . Part (b) of Theorem [3] indicates that this criterion remains effective 
under the influence of systematic absences. 

Now any e := ({Ri, R 2 }, {R3, R4}) G A 2 corresponds to a 2 x 2 Gram matrix S(e) G 
S 2 by the following map: 

«<««,,*},{*, *}» - _ Rl) iv "%^ - ■ w 

When C(e) is used as a sort criterion, e G A2 with smaller detS(e) is prioritized as 
a result, because A obs C [q m in,q m ax] contains a smaller number of t uS(e)u (u £ Z 2 ) if 
det5(e) has a larger value. This property of C(e) is also desirable, because ((U) in Table 
|6](and Theorem |4]) requires {I*, I*} G Pa(i*) (i = 2,3) to obtain the true solution L* . 
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Table 7: Recursive procedure to form a subgraph of a topograph using e and elements 

otA 2 . 

void expandSubtopograph(j4 2 , e, A 2 , T) 

(Input) A2 : array of four formal sums ({Qi, Q 2 }, {Q3, Qi]) with Qi :— 
^2i=\ ak n ijQj- Furthermore, it is assumed that every entry sat- 
isfies 



Val{2Q 1 + 2Q 2 ~Q 3 -Q 4 ) < c min{2Err(Q 1 + Q 2 ),Err(Q 3 + Q 4 )}, 

for some fixed constant c, and at least three of Qi (1 < i < 4) 
belong to A obs (i.e., there is q G A obs such that = <?). 
:= ({i?!,^},^,^) satisfying ({R u R 2 }, {R 3 , R 4 }) e A 2 . 
: subset of A 2 . 

: subgraph of a topograph composed of substructures corre- 
sponding to ({Qi, Q2}, {Q3, Q4}) € A2. (If such subgraphs are 
not unique, A 2 containing larger number of entries is prioritized.) 



1: (Start) For = (1, 2), (2, 1), let Sy be the set defined by 

2 ^ [{({i^iM^^e^igeA^} ifi?,GA o6s , 
10 otherwise. 

3: T12 := 0. A12 := 0- T 21 := 0. i 21 := 0. 

4: for i = 1 to 2 do 
5: Take je {1,2} 

6: for e 2 € 5^ do 

7: Call expandSubtopograph(A 2 , e 2 , Ty, Ay). 

8: if Aij < \Aij\ then 

q- f . — t 

10: A{j := A^ . 

11: end if 

12: end for 

13: end for 

14: Seti 2 ■.= {({R 1 ,R 2 },{R 32 R 4 })}UA 12 UA2i. 

15: Construct T by unifying Ti 2 , T 2 i and ({Ri, R 2 }, {R3, R4}), as in Figure [T51 



"In the actual computation, it is not necessary to construct a subgraph T, because a sort criterion for 
elements of A2 is defined using only A2. However, as T has a data structure of binary trees consisting 
of finite nodes and edges, it is not difficult to implement T in a program. 



e 

(Output) A 2 
T 
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Using C(e), a sublattice L* expanded by {l*,l*} G P 2 (L*) is prioritized over any other 
sublattices of L* of rank 2. 

To summarize the above discussion, it is only necessary to insert the following pro- 
cedures between @ and in Tabled in order to reduce the computation time. 

(2i) For each element e := ({Ri, R 2 }, {R3, R4}) & A 2 , compute C(e) by calling the 
procedure in Tabled (In this case, any e 2 & A 2 satisfies C(e) = C(e 2 ). Hence, 
the number of calls of the procedure is less than the size of A 2 .) 

(2ii) Sort A 2 in descending order of C(e). For e\,e 2 € A 2 satisfying C(ei) = C(e 2 ), 
they may be sorted by e\ < e 2 <^=> det5(ei) < detS(e 2 ). 

def 

(2iii) Remove the (N zone + l)th-to-last elements of A 2 using a fixed integer N zone > 
0. (The default value of N zone used in Conograph is given in of section 
18.11 When the default value is used, the computation time is approximately 
proportional to N pea k A -) 

In Section 18.21 we shall see how the computation time is decreased in practice by 
this improvement. 

7.4 Problems with the quality of powder diffraction patterns 

Among the problems described in (Al)-(AQ) and (^SJ) of Section [31 we have not yet 
clarified how to handle . In this section, we explain how missing and false elements 
in A obs influence the powder auto-indexing results. This issue is related to the quality 
of powder diffraction patterns as observational data. 

Using a peak search program equipped with Conograph, it is not difficult to obtain 
the positions of all the peak heights above a given threshold automatically and rather 
uniformly. Although the ability to decompose overlapping peaks depends on the peak 
search software, this is not a great problem in our algorithm, because an almost identical 
solution will be obtained even if qi 6 A obs is replaced with q 2 very close to q\ (to some 
degree, Err[qi] and Err[g2] will absorb the difference between q\ and q 2 ). 

By the algorithm in Table [51 normally, multiple Gram matrices of the reciprocal 
lattice L* of the correct solution L are generated from A obs . This is because a lattice 
L* has as infinitely many Gram matrices as bases of L*. Note that Gram matrices of 
different bases are computed from different g-values basically. Therefore when these 
Gram matrices are transformed into a reduced form, they are a bit different matrices 
owing to observational errors, but rather close to each other. 

In order to reduce the influence of ei > of (A^j, the existence of such duplicate 
solutions is very useful. Suppose that m Gram matrices of L* are generated from the 
set of q- values A ext (p) PI [q m in,<lmax\ containing no observational errors by applying 
our enumeration method. Then, when the same method is applied to A obs containing 
observational errors, the enumeration process fails to obtain the correct solution, only 
when none of approximate solutions of the m Gram matrices them are generated. The 
failure rate becomes very small as m increases, regardless of the magnitude of e±. If 
the size of A obs is augmented, or equivalently the range [qmin,Qmax\ is magnified, m 
increases naturally. As a result, the enumeration success rate increases monotonically 
as the size of A obs increases. However, it should be noted that the time for enumeration 
of solutions is roughly proportional to the fourth power of the size of A obs , if the default 
parameters of Conograph are used. In addition, the success probability will not increase 
much once q max reaches some observational limit, because larger q- values have larger 
errors owing to peak overlap and observational accuracy. 
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Next, we discuss the influence of €2 in (421) • in this case, the success rate in obtaining 
the true solution remains same if A obs is replaced with A ohs U {q obs } which contains a 
false element q obs . (Of course, such q obs increases the time for enumeration.) 

In conclusion, the enumeration process is considered to be robust to missing and false 
elements in A obs , although such elements increase the computation time. Indeed, the 
procedure to sort candidate solutions executed after the enumeration is more sensitive 
to missing and false elements, because figures of merit are severely affected by these 
elements. In general, for a poor quality powder diffraction pattern, it is very difficult to 
judge which solution is correct, even if the q- values of respective solutions are manually 
compared to actual peak-positions (see Example |5] in Section 1572)) . 

Example [5] shows the case of a two-phase sample. Both lattice parameters are ac- 
quired by Conograph. However it seems to be almost impossible to judge which is the 
correct second phase parameter in this case. 

8 Implementation and results of Conograph 

Before introducing the input parameters and results of Conograph, we explain the cir- 
cumstances in which the program was verified. The Conograph source code is written 
in C++ and OpenMP, and compiled with Mingw (GNU Compiler Collection for Win- 
dows). The computer used for the test has an Intel i7-2620 (2.70 GHz) processor and 12 
GB RAM. The processor can execute parallel computing with eight hyper-threads, all 
of which were used during the test. Additionally, we confirmed that a computer with 4 
GB RAM was able to carry out the same test. 

8.1 Input parameters of Conograph 

The input parameters used in the enumeration process are listed in Table |8j In partic- 
ular, "AUTO" is used to set parameters that depend on respective powder diffraction 
patterns. The parameters required after the enumeration are not given here; they will 
be introduced at another time. 



Table 8: Default parameters of Conograph. 



Symbol 

A20 

c 



N, 



peak 



N z 
N. 



sol 



Vol„ 
VoL 



Meaning 

Zero-point shift (degree) 

Tolerance level for errors in g-values 

Characteristic X-rays or neutron reactor sources 
Synchrotron X-rays or neutron spallation sources 

Number of g-values used 

Threshold for the maximum number of ({Qi, Q2}, {Q3, Qi}) 
Threshold for the maximum number of solutions 
First trial 

When the first trial failed because of too many solutions' 1 
Threshold for the minimum volume of R 3 /L (A 3 ) 
Threshold for the maximum volume of R 3 /L (A 3 ) 



Default 

0. 

1.5 
1. 

AUTO 
AUTO 

AUTO 

64000 x m (to e Z >0 ) 

AUTO 

AUTO 



a N so i is set to 64000 at most when AUTO is used, in order to prevent memory allocation errors in 
memoryless computers. When the number of enumerated solutions exceeds N ao i, Conograph removes 
those with a smaller unit-cell volume. As a result, the true solution is sometimes also removed. The 
simplest method of resolving this is to reduce Vol ma:E or increase N ao i when unsatisfactory results are 
obtained. This time, we chose the latter in order to measure computation time. However, such a trial- 
and-error approach is not necessary, because an alternative method is adopted in Conograph. This will 
be introduced at another time. 
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In the following, we provide an explanation of the parameters and formulas needed 
to compute AUTO. 

(1) Zero-point shift A29. Depending on the type of diffractometers, x-coordinates of 
a powder diffraction pattern is given as the following function of q- values. 

26> = 2sin- 1 ^^ + A20, (94) 

n 

Time-of-flight = ^c i9 ~ l/2 . (95) 

i=i 

Using these equations, x-coordinates are transformed to g-values before executing 
powder auto- indexing. Among A, A29 and q (1 < i < n), only the value of A29 
is unknown. Conograph users are recommended to set A29 = normally, because 
successful results were obtained even in cases of very large zero-point shift, such as 
A29 w 0.2 degree. After auto-indexing, it is possible to refine the lattice parameters 
and zero-point shift simultaneously using a nonlinear least-squares method. 

(2) Tolerance level c for errors in q- values. Table [5] provides a usage example of c. 
By setting a larger value of c, a wider range of combinations of q- values is searched. 

(3) Number of q-values used. This parameter determines the number of g-values 
in A obs . After sorting the q- values in A obs into ascending order, the (N pea k + l)th- 
to-last parameters are removed before the powder auto-indexing commences. As 
explained in Section 17.41 both the computation time and success rate increase as 
Npeak is magnified. The default value of N pea k is calculated by: 

N peak := min{tt{q< 10/d 2 : q e A ohs },48}, (96) 

where )JT is the cardinality of the set T, and d is the minimum threshold for the 
distance between two lattice points. In Conograph, d is set to 2.0A 

In (j96|) . Np ea k < 48 is forced, because it is frequently meaningless to use large q- 
values owing to severe peak overlap. However, 48 is still much larger than the 20-30 
that are normally adopted for powder auto-indexing. Conograph uses such many 
q- values for another reason; in (-4SJ) an d the paragraphs following (v^BJ , we explained 
that q m ax > Djy := 3-2 _1 ' 3 d -2 ps 0.6A~ 2 is required at least theoretically. (In ([Mil . 
qmax > 4Djv is adopted.) Figure [TBI presents the range of the first N pea k q- values in 
the test data of Table [TBI Owing to the threshold of 48, the interval [q m in, q ma x] is 
frequently much smaller than 0.6A -2 , although powder auto-indexing succeeded in 
all the test data. We suppose 48 g-values might be insufficient in some exceptional 
cases. Users are recommended to increase N pea k manually, if results obtained with 
the default parameters are unsatisfactory. 

(4) Threshold for the maximum size of ({Qi, Q2}, {Q3, Qa})- 

N zone := ^Npeak{Npeak + 1). (97) 

(5) Threshold for the maximum number of candidate solutions. 

N sol := min{64000, N 2 zone }. (98) 

(6) Threshold for the minimum and maximum of the volume of M 3 /i. 

Vol mi „ := max{5,u^ 1 }, (99) 
Vol mQX := 30Vol mm , (100) 
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— Synchrotron 



J- Characteristic X-rays 

} Spallation neutron src 
(Back scattering bank) 

— — ; } Reactor 

1 — i — i — i i 1 1 1 1 1 — i — i — i 1 1 1 1 1 1 — i — i — i i 1 1 1 1 1 — i — i — i 1 1 1 1 1 

0.001 0.01 0.1 06 1 10 

q (A-2) 

"If <lmax > 0.6A~ 2 is imposed, the interval [qmimlmax] often includes more than several tens of q- 
values of diffraction peaks. We should not increase q m i n , because smaller q- values have better accuracy. 
As a result, \q m i n , q m ax\ set by the default parameters often does not satisfy the theoretical requirement 
Qmax > 0.6A -2 . Nevertheless, powder auto-indexing succeeded in all the test data. This is considered 
to be because all test data satisfy d > 2.7 A (and many also satisfy d > 4A), and because the formula 
Jill) of Lagarias et al. provides an overestimation of D3 . 

Figure 15: Range [q m ini Qmax] of N pea k q- values used in powder auto-indexing. 



where 5A 3 is chosen as the threshold for the minimum volume of existing crystals, 
and V20 is the upper bound of Vol(IR 3 /L*), estimated using the 20 smallest elements 



of A° bs :={qi,q 2 ,- 



,qN F 



J by: 



2tt q 



3/2 3/2 



(101) 



J 3 j-1 

Equation (I101[) is based on the following formula, which holds for any < r < R 



2tt{R 3 - r 3 ) 



{r 2 < |r| 2 <R 2 :l* e L*} 



> 



4ir(R 3 -r 3 ) 



{I* eL* :r 2 < \l*\ 2 < R 2 } 



Vol(R 3 /£*) as R -> 00. (102) 



Note that 20 and 30 are chosen empirically, and VN Psak is used instead of V20 if 
Npeak < 20. We have found no cases when this [Vol m i„, Vol max ] fails to contain the 
correct volume. 



8.2 Results 

We prepared 26 + 4 powder diffraction patterns as test data. A summary of the first 
26 test data is presented in Table [9] The remaining four are presented in Examples [3H6] 
to illustrate some rather special cases. 

We now evaluate how the method in Section 17.31 improved enumeration times. In 
Section 17.21 we explained that the time is roughly proportional to N^ one . From the 
formula (|97[) . the computation time of the algorithm in Table [5] is approximately pro- 
portional to Npeak 4 . Figure [TBI illustrates the relation between the following pairs in 
practical tests: 

• number of (/-values N pea k and time for enumeration, 

• number of q- values N pea k and total time for powder auto-indexing, 

• number of enumerated solutions and time for enumeration, 
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Table 9: Summary of 26 test data' 



Diffractometer 


number of patterns 


Synchrotron 


11 


Characteristic X-rays 


7 


Spallation neutron sources (time-of-flight) 


6 


Reactor neutron sources 


2 


Symmetry of lattice 




Triclinic 


7 


Monoclinic (P) 


4 


Monoclinic (B) 


1 


Orthorhombic (P) 


5 


Tetragonal (P) 


2 


Tetragonal (I) 


1 


Rhombohedral 


1 


Hexagonal 


1 


Cubic (I) 


1 


Cubic (F) 


3 


Distribution of absolute zero-point shifts b 




- 0.1 


17 


0.1 - 0.15 


1 


0.15 - 


2 



"Among eight samples distributed in SDPDRR-2, samples 1-4, 8 are included herein. The result 
of sample 7 is used in Example [6] Samples 5, 6 are excluded this time, because we could not obtain 
solutions with de Wolff figures of merit M20 > 10 using peaks with sufficiently large intensities. These 
patterns seem to contain a considerable number of false peaks. 

'The zero-point shifts were computed after execution of powder auto-indexing by non-linear least 
squares method. Time-of-flight data are excluded here. 
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Number of g-values N pea i 



A 


Cubic 





Hexagonal 


X 


Rhombohedral 


□ 


Tetragonal 


O 


Orthogonal 


o 


Monoelinie 


+ 


Triclinic 



++ + 

&<t> 



a a 

I-TTTTT, | | 

io 3 io 4 

Number of enumerated solutions 



10 



300 ■ 
240 







+ 


A 


Cubic 




O 


Hexagonal 


+ 


X 


Rhombohedral 


O 


□ 


Tetragonal 


+ 





Orthogonal 




o 


Monoelinie 




+ 


Trielinie 





Number of q- values N pe 



A 


Cubic 





Hexagonal 


X 


Rhombohedral 


□ 


Tetragonal 


o 


Orthogonal 


o 


Monoelinie 


+ 


Triclinic 



A A 

n-TTTT, | | 

io 3 io 4 1C 

Number of enumerated solutions 



+o 



"Total time includes a) Enumeration of solutions, b) Transformation of every solution to a reduced 
form, c) Computation of figures of merit, d) Refinement of solutions using linear optimization, e) Error- 
stable Bravais lattice determination, f) Removal of solutions having a figure of merit less than a (user- 
input) threshold, g) Removal of duplicate solutions, h) Sorting solutions with a selected figure of merit. 
Of these, a) and e) are the most time-consuming. The time required for a) consumed approximately 
half of the total time. We have already contributed to reducing the time for e) in I23| . 



Figure 16: Computation time of Conograph. 



• number of enumerated solutions and total time for powder auto-indexing. 

Figure [T7] shows the rate of decrease in the size of A-i as a result of applying the 
method described in Section 17.31 

Even in the following difficult cases, solutions were obtained without special parame- 
ter settings. By Conograph, reliable powder auto-indexing results will become available 
even for less-experienced users. 

Example 3. Non-unique solutions (Figure 1181 (a)). For any fixed C > 0, the 
following lattice parameters have exactly the same q-values (cf. [3J in Avvendix[B\) . 

Cubic(P) : a = b = c = C,a = /3 = 7 = 90, (103) 
Tetragonal(P) : V2a = V2b = c = C, a = (3 = 7 = 90. (104) 

Conograph succeeded in finding both of these. 

Example 4. Small q- values are lost (Figure 1181 (b)). When the size of the unit 
cell is large, small q-values are frequently lost because q < q m in- We have confirmed that 
Conograph is very robust to such a loss. This example presents the case in which the 19 
smallest non-zero q-values are not included in the observed range [q m im Qmax] ■ 

Example 5. Two-phase data (Figure 1181 (c)). This powder diffraction pattern is a 
two-phase sample with the mass ratio 58 : 42. The lattice parameter of the second phase 
was also enumerated. 

Example 6. Poor quality powder diffraction pattern (Figure 1181 (d)). This is 
sample 7 distributed in SDPDRR-2. According to a recent personal report from Le Bail, 
crystal structures other than sample 7 have been determined. Conograph obtained three 
reasonable solutions. 



38 





1 - 












i- 










A 


Cubic 








A 










A 









X 


Hexagonal 
Rhombohcdral 


5' 
o 


J- 






A 






o 


A 








□ 
O 


Tetragonal 
Orthogonal 
Monoclinic 




2- 


A 


Cubic- 


A 







CM 2 . 


A 










Triclinic 






O 


Hex agonal 




X 




x O 














X 


Rhomb ohedral 














oo 


+ + 






0.1 - 


□ 

O 


Tetragonal 
Orthogonal 






i 


fel 0.1, 







% 

+ 


+ 

+ 


O 








o 


Monoclinic 


















O 




6- 
5- 


+ 


Triclinic 








5- 




o 









10 20 30 40 50 50 100 150 

Number of g-values N peak Time for enumeration (sec.) 



"Except for cases of small N pea k, all the rates are in the range 0.06—0.18. Under the assumption 
that the time for enumeration is proportional to N^ one , the enumeration is about 32-250 times faster 
as a consequence. The right-hand figure indicates that a larger improvement was made in more time- 
consuming cases. 

Figure 17: Rates of elements of A2 used in enumeration algorithm. 

9 Conclusion 

Powder auto-indexing is divided into two main stages: enumeration and sort of solu- 
tions. We contributed mainly to the stage of enumeration by providing a quick and 
strongly reliable algorithm. For the purpose, Conway's definition of topographs was 
generalized to lattices of any rank TV using Voronoi's second reduction theory, holding 
the association of edges and four lengths 1^1, |Z||) |^i \^i ('±^2 e L*). By 

using common properties of systematic absences proved in Theorems [TJ [3] and HI the al- 
gorithm is shown to work regardless of the type of systematic absences. This properties 
are stated as distribution rules for reciprocal lattice vectors corresponding to systematic 
absences on a topograph. Such rules have not been known so far, and will be also use- 
ful in other problems of crystallography. Our enumeration algorithm was implemented 
in Conograph. Conograph obtained successful results, even for difficult cases. These 
examples proved that the new enumeration method is robust against missing and false 
elements in the set of lattice vector lengths extracted from a powder diffraction pattern. 
Topographs were also utilized to speed up our enumeration algorithm. In practical tests, 
we found that the improvement reduced the enumeration time to 1/250-1/32. 
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(a) Case of more than one solution. 



Cubic(P) 
Tettagonal(P) 



iiiiiii ii in i 1 1 1 1 1 



(c) Two-phase data. 



Time of flight 



J 




(b) The 19 smallest ^ q e A L » are lost. 




Time of flight 

(d) Poor quality powder pattern. 




Time of flight 



26 Meg.) 



"Triangles indicate peak positions detected by a peak-search program (corresponding to q-values in 
A obs ). Tick marks represent the peak positions corresponding to q £ A/,* := {|i*| 2 : 2* £ L*}. It is seen 
that two different lattices have exactly the same q- values. Both parameters are detected by Conograph. 

'This example presents the case in which the 19 smallest non-zero g-values are not in the observed 
range. It is confirmed that Conograph is very robust to such a loss. The lattice parameters are a = 8.82, 
b = 9.76, c = 9.78, a = 7 = 90, = 104, and a diffraction pattern from a back-scattering bank of 
neutron sources is used. 

c This powder diffraction pattern is a case of a two-phase sample with the mass ratio 58 : 42. 



Upper : a = b = c = 12.0 (Cubic(I) , M 20 = 7.0), 

Lower : a = b = 4.8, c = 13.0 (Rhombohedral, M 2 o = 1.4). 



(105) 
(106) 



The unit cell of the first phase is much larger than that of the second phase. As a result, the lattice 
parameter of the first phase gained the best de Wolff figure of merit among all the enumerated solutions, 
regardless of peaks derived from the second phase. The lattice parameter of the second phase was also 
enumerated by Conograph. This supports our claim in Section 17.41 that the enumeration process is 
robust to missing and false elements of A obB . However, the figure of merit of the second phase was 
considerably small, owing to the first-phase peaks. Therefore, it is almost impossible to judge which 
one is the correct second-phase lattice. 

d This is sample 7 distributed in SDPDRR-2, held in 2002. The tick marks are peak positions of the 
following solutions output by Conograph: 



Upper 
Middle 
Lower 



a = 3.99, b = 11.5, c = 17.1, a = 77.9, p = 84.6, 7 = 80.4 (M 20 = 
a = 4.07, b = 22.5, c = 25.7, a = 88.8, = 87.4, 7 = 84.8 (M20 = 
a = 3.95, b = 17.1, c = 22.8, a = 78.4, =, 86.6, 7 = 84.1 (M20 



6.6) , (107) 

7.7) , (108) 
= 12.9). (109) 



The upper lattice parameter is the solution proposed by participants in SDPDRR-2. This regards the 
two leftmost peaks to be false. The others are newly found by Conograph. The second solution assumes 
that some peaks are embedded in background noise, and the third is intermediate. This result proves 
that Conograph makes the stage of enumeration much more reliable. Although the correct solution is 
provided by a larger M20 normally, this is not always right. In particular, it becomes very difficult to 
select the best solution from peak positions alone, when powder patterns are assumed to have a number 
of diffraction peaks embedded in background noise or false peaks as intense as diffraction peaks. 



Figure 18: Results of Conograph. 
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A On equivalence between powder diffraction pat- 
terns and average theta series 

For any periodic function p with the period lattice L satisfying J r n / L \p{x)\dx < oo, an 
average theta series & P (z) is defined by: 

e P (*) := vn]( uN /n £ / P(^)p(y) e 2 ^ |a; -^ |2 ^y, (no) 

VOl(K /^j^J(R« /L) 2 

where vol(M w /L) is the volume o£M. N /L. (Q p (z) is invariant if a sublattice L2 L is 
regarded as the period of the same p. This definition is a generalization of the average 
theta series defined in 2.3, Chapter2 of |S].) The infinite sum converges uniformly and 
absolutely in any compact subset of {z £ C : Im(z) > 0}. 

Using the Poisson summation formula, the functional equation for O p (z) is obtained: 



e - (z)= (V) £ e-^in 2 |p(D| 2 , (in) 



where p(l*) := vo\{R N / L)- 1 J RN /L p(x)e~^^ x - v dx. 

Assuming that f pow der(Qj p) i n © equals for any q < 0, the Fourier transform of 
(2 y /q)^ 1 j 'powder (<?; p) is an average theta series. 



f f P o wd eAq;p)e 2 ^^^- = [([ T \p(l*)\ 2 S(x* ~r)e 2 ^ lclz dx* ) 

= [ Y \p{l*)\ 2 6(x* ~r)e 27I ^ x *\ 2z dx* 



dq 



J2 e>^*l 2 *\p(n\ 2 = (-2V^z)- N eJ-±). (112) 



Therefore, information obtained from a powder diffraction pattern is theoretically 
equivalent to that from an average theta series. 



B Theorems on the cardinality of solutions 

It is well known that the equivalence class of So G Sy is not uniquely determined, even 
if all elements of As := {SjSoV : ^ d £ Z N } are provided. However, for N < 4, it 
is possible to obtain a finite set containing all the equivalence classes of S € Sy with 
A s = A So (cf. Appendix P. 

In this section, several known theorems about the cardinality of solutions are sum- 
marized for reference. 

1. Case N = 1. In this case, A p in ((SJ) generates L* over Z. (Otherwise, let L* 2 C L* 
be the lattice generated by A p . Then, the reciprocal lattice L2 of L\ is the period 
lattice of p, since p(x) — J2i»eL* p(l*)e 2jr ^~^ x ' 1 " holds. This is a contradiction.) 
Therefore, the determination of L is straightforward, even from A p . 

2. Case N = 2. It was shown by Delone (and independently by Watson [33], [55] ) 
that, up to a factor, the following is the only pair of inequivalent positive-definite 
symmetric matrices that have the same representations over Z. 
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3. Case N = 3. From the case of N = 2, an infinite family of inequivalent pairs that 
have the same representations over Z is obtained. 

2 1 0\ /2 0\ 

1 2 0,0 6 01. (114) 

cj \0 c) 

The following is another example (see [5D] for the proof). 

1 0\ /l 0\ 

1 0,0 2 . (115) 
1/ \0 2/ 

A result from Conograph for this case is presented in Example [3] of Section 18.21 

4. Case N = 4. S € is called universal if Ag equals Z>o, the set of all positive 
integers. It was confirmed by Bhargava and Hanke that the number of equivalence 
classes of universal S G S y0 equals 6436 [JJ. 

5. Case N > 5. From the existence of universal quadratic forms in N = 4, there are 
infinitely many S £ <S^ such that As = Z>o. 

C Lattice determination from a complete set of lat- 
tice vector lengths 

In this section, for any N < 4 and a given As C R>o of some So € Sy Q , an algorithm 
to enumerate all the equivalence classes of S G Sy satisfying As = A5 is introduced. 

First, we recall that S := (sjj)i<i,j'<JV £ is Minkowski-reduced if and only if 5 1 
satisfies 

Sij = min{ f ViSvi : {ei, • • • , e^—i, Vi} is a primitive set of Z^}. (H6) 

When S is Minkowski-reduced, the entries of S satisfy 

< flu < • • • < s NN , 2\ Sij \ < s u (1 < i < j < N). (117) 

Table [10] gives a recursive procedure to generate all the candidates for S from A obs := 
As H (0, c), when < c < 00 is large enough. If the recursive procedure is started 
with arguments m = n = I= J = l, all positive-definite symmetric matrices S := 
{sij)i<i,j<N satisfying the followings are enumerated in an array Ans. 

{six < ■■■ < s NN , 2\sij\ < su (1 < i <j < N), 
Sii+i <0(1<KN), (118) 
{s u :l<i< N} U {s u + Sjj + 2s i:j : 1 < i < j < N} C A obs . 

Consequently, any Minkowski-reduced S satisfying (| 1 18[) and 3s nn < <Im is output 
in Ans. As a result of Proposition lC.il the recursive procedure is always completed in 
a finite number of steps, even if we set c = 00, i.e., if A$ is used instead of A obs . This 
indicates all the equivalence classes of S € S N satisfying A5 = As are enumerated by 
the recursive procedure, if sufficiently large c is selected. Consequently, the number of 
equivalence classes of S G Sy satisfying A5 = Ag is finite for any As C M >0 . 

In the remainder of this section, we give a proof of Proposition lC.il 

Proposition C.l. Suppose that S € and S 2 <E «S$j and < N 2 < N < 4. Then, 
As 2 3> As- 
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Table 10: Recursive procedure for lattice determination from a perfect set of represen- 
tations. 



(Input) 



void enumerateLattice(iV, A obs ,m, n, S, I, J, Ans) 



A ob 



1 < N < 4 



(Output) 
1: (start) 
2: 
3 
4 
5 
6 
7 
8 
9 
10 
11 

12 

13 
14 
15 
1G 
17 
18 
19 
20 
21 
22 
23 
24 
25 
2G 
27 
28 
29 



to, n 
5 := (s 



I, J 



Ans 

for I = I to J do 
if m = n then 



ft- 



else 



J mn — J nm 2 v 

end if 

if to = 1 then 

if det(sy-)i<ij<„ > then 
if n > N then 

Insert S in Ans. 
else 

TO2 a := max i 1 < i < M 



number of rows and columns of So € Sy , 

a sorted sequence of all the elements of As that belong to 

the interval (0, c), 

integers 1 < to < n < N indicating the (to, n)-entry of S, 
N x N positive-definite symmetric matrix that fulfills 
< sii < I < s u (1 < % < j < n), Stf+i < 

(1 < i < n). 
integers indicating 

J s nn e {ft : I < i < J} if m = n, 

|snn = 9/ and s mm + s nn + 2s mn € {ft : / < i < J} otherwise, 
array of positive-definite N x N symmetric matrices. 



9 (ft Smm Snn)- 



qi, ■ ■ ■ , ft_i are representations of 
the submatrix {sij)i<i,j<n over Z 



if to = n then 

Call searchLattice(7V, A ohs , n + 1, n + 1, 5 1 , 1, to 2 , Ans). 
else 

Call searchLattice(iV, A obs , n + 1, n + 1, 5, 7, m 2 , Ans). 
end if 
end if 
end if 



else 

if m = n then 

m 2 := max{l < i < M : qi < s„_i„_i + s„„}. 

Call searchLattice(iV, A obs , to — 1, n, S 1 , /, to-2, Ans). 
else 

m 2 := max{l < i < M : ft < 2s m _i TO _i + s„„}. 
Call searchLattice(iV, A obs , to — 1, n, 5, 1, m 2 , Ans). 
end if 
end if 
end for 



"Proposition ICTTI claims that there always exists m,2 < oo, even if the sequence A obs is replaced by 
A So virtually. 
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Lcmma lC.ll is utilized in the proof. 

Lemma C.l. Any S £ Sy is represented as a finite sum J2k ^kSk such that every Sk 
is a positive-definite symmetric matrix with rational entries, and Xk G K>o we linearly 
independent over Q. 

Proof. For any S := (sij)i<i.j<N € S N , let v$ be the vector '(sn, S12, • • • , sy, • • • , swat) 
of length N • Then, 5^ is identified in w ^ + 1 ^ - dimensional vector space by the 
map S" i— » W5 . Define a set P by: 

P := {/ C {sij : 1 < i < j < N} : sij € / are linearly independent over Q}(119) 

Then, P is not empty. Let {t\, ■ ■ ■ ,t m } be one of the maximal elements of P under 
inclusive order. When vectors \t±, ■ ■ ■ , t m ) and '(1, • ■ • , 1) of length to are denoted by t 
and l m respectively, there exists an N (^ +1 *> x m rational matrix C such that vs = Ct. 
Furthermore, there exists e > such that for any U :— (iiij) G GL m (M) with entries 
< e, every column of C(t*l TO — U) is the image of a positive-definite symmetric 
matrix by the map S > v$- 

Let l m be the identity matrix of size m. If t l m U~ 1 t 7^ 1, we have equations: 

(tXn-u)- 1 = u-'iCimU-h-iyhX.u- 1 -i m ), (120) 
(tXn-uyh = Ci m u-h - i^u-h. (121) 

If all the entries of U~ 1 t are negative, we have that t l m U~ 1 t < 0, and every entry of 
(t'l m — U)~ 1 t is positive. Clearly, there exists U := € GL m (M.) such that \uij\ < e, 
every entry of U~ 1 t is negative, and the matrix t*l m — is rational. Fix such a [/. 

Let S'fc (1 < fc < to) be a positive-definite symmetric matrix satisfying C(t*l m — 
U) = (vsu ■ ■ ■ > v S m )- S is then represented as a linear sum of rational Sk with positive 
coefficients as follows: 

v s = Ct = (v Sl ,--- , V s m )(t t l m -[/)- 1 t. (122) 

Hence, the statement is proved. □ 

For any ring i?2 C R and a symmetric matrix S with entries in R2 , let A5 (R) be the 
set { l vSv : v E R N } consisting of representations of S over R. If € Ag(P), 5 is 
said to be isotropic over R. Otherwise, S is anisotropic over P. 

Proof of Provosition \C . 1\ The statement holds if it is true when 7V2 + 1 = N = 4. By 
Lemma lC.li it is sufficient if the statement is proved in the case that S, S2 are rational. 
Any non-singular quadratic form over Q p of rank 4 satisfies As(Q p ) D Qp for any p. On 
the other hand, any anisotropic quadratic form over Q p of rank 3, there exists a finite 
prime p such that As 2 (Q p ) 7$ , (c/. Corollary 2 of Theorem 4.1 in Chapter 6, [5]). If 
A S2 D A s , A 52 (Q) D A S (Q), therefore A S2 {Q P ) D A S {Q P ) is required for any p. This is 
a contradiction. □ 
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